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It has been noted recently by Artin and by Baer that one obtains the funda- 
mental theorem of the Galois theory most readily by starting with a finite group 
® of automorphisms in a field P and determining the structure of P over @ the 
set of invariant elements. One proves that P is finite, separable and normal 
and @ is its Galois group over ®. The theorem is completed by proving that 
any isomorphism between subfields over ® in a finite, separable and normal 
extension P of can be extended to an automorphism in P. It follows that the 
Galois group of P over @ has © as its set of invariant elements. The correspond- 
ence between subfields and subgroups follows readily. The usual proofs of these 
theorems are obtained by strictly commutative methods (symmetric functions, 
unique factorization of polynomials). 

In the present paper we begin with an arbitrary quasi-field P and a finite 
group of outer automorphisms © acting in P and establish the correspondence 
between subgroups of @ and sub-quasi-fields 2 between P and ® the set of 
invariant elements. The methods are necessarily those of non-commutative 
algebra. The particular tool used is the theory of simple rings. We obtain 
some applications to division algebras. The first section is introductory con- 
taining results that are for the most part well known. In the last section we 
give a generalization of Hilbert’s theorem on the elements of norm 1 in a cyclic 
field. 


1. By a transformation 7 we shall understand a single-valued correspondence 
between the elements of a set P and those of a subset. We denote its effect on a 
by aT. T and S are regarded as equal only when they have the same effect 
on all a’s of the set. The product ST is defined by a(ST) = (aS)T. If Pisa 
group written additively S + T is the trausformation such that a(S + T) = 
aS + aT and T is an endomorphism if (a2 + 8)T = aT + BT. With these 
conventions it is readily verified that the set of endomorphisms of a commutative 
group form a ring with an identity, namely, the identity transformation. The 
zero element is O given by aO = 0 and the negative of T is a(—T) = —af. 
If T is (1 — 1) its inverse is also an endomorphism. When we speak of an 
endomorphism of a ring P we shall mean an endomorphism of its additive group. 

A commutative group P in which there is defined a set of endomorphisms 


1 Presented to the Society April 7, 1939. 
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forming a quasi-field ® which includes the identity transformation is called a 
vector space. P has finite dimensionality (P:®) = n if it contains n elements 
a, @2,--++,@, such that every element is expressible uniquely in the form 
aks + ake + +++ + an’n, & €@®. Asis well known nis an invariant and may be 
characterized as the maximum number of linearly independent elements in P. 
We suppose hereafter that (P:®) is finite. 

An endomoprhism A commutative with all the transformations in ® is called a 
linear transformation (I. t.) of P over ®. Their totality is a ring A. Now 
suppose a; a2, ---, is a basis for Pover®. Clearly A is determined by a,A 
and we may choose a, A arbitrarily and define a. t. by the condition (}va:t;) A = 
(a:A)E;. In particular we define E;; by a,E;; = and by a,t’ = 
for any in ®."" The totality of transformations &’ forms a quasi-field ®’ anti- 
isomorphic to ® since a,(&n)’ = a,fn = a,t’n = ant’ = a,’t’. Evidently P 
is a vector space over ®’ and since a,£’ = a,€, a1, a2, --+, @n form a basis for 
this space. It is readily seen that a,t’E;; = a,E;;t’ and hence &’E£;; = E;;t’. 
If a,A = >> ajt;, then A = >> E,,€}; and hence the set of 1. t.’s is identical with 
the set of these transformations. If >> E,,€}; = 0 then >> a,&;, = 0 and by the 
uniqueness we have &;, = 0. If we regard P as a space over ®’ and use the basis 
a;, we obtain a,t’” = a,€ and hence the |. t.’s — and &” are identical. The 
E;; are 1. t.’s in P over ®’ and the general form of such a transformation is 2 E;;€ ji. 
We note also that 


En + Ent --- + En. = 1, Ey = . 
Hence if E,.(2 = (2 Eg 
Epitig = Eigkpi 


and &;; = Oif i ¥j, &;; = & and similarly the condition that = E;;€;; commute 
with all £,, is that it belong to ®. Now if 7(T7’) is any endomorphism com- 
mutative with all the 1. t.’s = Ey,€}(2 E;,€;;) it is an 1. t. in P over ®’(@) and 
commutes with the E,,. Hence T ®’). 

If P is a ring with an identity the transformations a — at = aé& are endo- 
morphisms and their totality is a ring P isomorphic to P. The transformations 
a — ta = at’ are endomorphisms and form a ring P’ anti-isomorphic to P. 
The elements of P commute with those of P’. Furthermore if 7’ is any endo- 
morphism commutative with all the — and 17’ = 7, then a7’ = (la)T’ = (1a)T’ 
= (17’)e = ra = ra = ar’. Hence T’¢P’. Similarly P is the complete set of 
endomorphisms commutative with those of P’. 

If P > @ > 1 and @ is a quasi-field the transformations corresponding to the 
elements of form a quasi-field ® and P is a vector space over ®. If left multi- 
plication is used in place of right multiplication we obtain a quasi-field ®’ and 
P is a vector space over ®’. Thus, for example, the set of 1. t.’s of any vector 
space over ® contains ®’ and may be regarded as a space over ®’. Now suppose 


1@ Note that the & as well as the E;; depend on the basis a1, a2, ---, an. 
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E is a space over P and P 2 ® > 1 where the dimensionalities (E: P) and (P:®) 
are finite. Ifa, a2, ---,a, is a basis for E over P, & , &, ---, & one for P 
over ® then §;; = a;€; form a basis for E over ® and hence (E:®) = (E:P)(P:@). 

We recall also the definition of a semi-linear transformation 7 with automor- 
phism S (in P) as an endomorphism in E such that (a&)T = (a7)é* for every 
Ein P. 


2. Let P be an arbitrary quasi-field and @ a group of n automorphisms 1, 
S,T,--., Uin P ((a + = + (aB)§ = We suppose that no 
S ¥ 1is inner, ie. has the form a > £ "at. Let (P, S) denote the set of endo- 
morphisms of P generated by the transformations & of P, a > at = a€ and the 
automorphisms S. (P, S) consists of the sums 2 Sf; and &S = Sk* where 
corresponds to Suppose 


(1) Sés + T&r + --- + U&y = O. 


If §; ~ O # Er we multiply (1) on the left by n and on the right by &'n°ts 
and subtract to obtain 


T(n"Er — Erts'n fs) + --- = O 
which has at least one term less than (1). Since 7S™ is not inner there exists 


an 7 such that 


and hence we may suppose that the new coefficient of Tis ~ 0. Continuing 
in this way we get finally a single terms S{s = 0 where {s ¥ 0. Since Sand fs 
have inverses this is impossible. Thus the space (P, S) over P has n dimensions. 

(P, S) is a simple ring with anidentity. For if 8 is a two-sided ideal and con- 
tains )> Ss ~ O the above argument shows that 8 contains some Sfs , fs ¥ O. 
Then 8 > land 8 = (P, S). Any right ideal 3 of (P, S) is asubspace over P. 
It follows that (P, S) satisfies the chain conditions for right ideals. Hence the 
following theorems hold.” 

Any two irreducible right ideals are similar, i.e. there exists a (1-1) correspond- 
ence X — X between 3 and § such that 


X+Y={X+Y XA=XA 


forany Ain (P,S). In particular, if we restrict A to be in P we see that X ~ X 
is a (1-1) 1. t. between 3 and § and hence (9:P) = (3:P). Since the elements 
(1+ S+7+4...+ U)é, variable, form a right ideal of dimension 1, we see 
that (9:P) = 1 for every irreducible %. 

(P, S) = 3. + 32+ --- + 9, (direct sum) where the 3; are irreducible right 
ideals. Hence n = ((P, S):P) = 7(9;:P) =r. It follows that (P, S) = W,a 


2 See v. d. Waerden, Moderne Algebra, II, pp. 149-177 or Deuring, Algebren pp. 8-25. 
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matrix ring, i.e. the elements of (P, S) have the form > £;;;; where the ’s 
form a quasi-field, E;; # 0, commute with the t’s and satisfy 


(2) Ey + Ex +--- + En =1 = 


Let a be any element # 0 in P. By (2) there is an E,, such that aE,, ¥ 0 
and since wE,, = aE,;Ei, , this implies that no a; = aE£,;is0. Suppose 


+ oath, + + = 0. 


Then (aE pth + aE + = (aE = 0 which implies 
u; = 0. On the other hand since 4, includes P, any element y in P can be ob- 
tained from a fixed 6 ¥ 0 by operating with a suitable > E;;4;;. Thus 


= (aE pp) = + + anthn 


Hence the a; form a basis for P over W and from their definition we have 
a, Ej; = Sina; 

As in §1 we associate with each ¢ the transformation yy defined by (>> aahs)¥’ 
= >> aw); and denote the totality of these transformations as W’. P is a vector 
space over W’ with a basis a1, az, --- ,a@, as a basis. The transformations 
>> E.,:; form the complete set of |. t.’s in P over W’ and the elements of 
may be characterized as the endomorphisms commutative with all the }> Ei,hi; . 

On the other hand W, = (P, S) and if an endomorphism commutes with all of 
P it is in P’ the set of left multiplications a — ta = at’ and if in addition it 
commutes with all the S’s we must have £in®. We have therefore shown that 
w’ = @’ and (P, S) is the complete set of |. t.’s of Pover ®’. If we use (P’, S) 
in place of (P, S) we obtain (P:®) = n and (P’, S) is the complete set of]. t.’s 
in the vector space P over ®. Hence we have the important theorem. 

THEOREM 1. Let P be an arbitrary quasi-field, © a finite group of n outer 
automorphisms acting in P and ® the sub-quasi-field of invariant elements. Then 
the order (P:®) = (P:@’) = n and (P, S) the set of transformations = S€s is the 
complete set of linear transformations of P over ®', (P’, S) the complete set of linear 
transformations of P over ®. 

Suppose V is any automorphism in P leaving the elements of ® unaltered. 
Then Vis al. t. in P over ®’ and hence V = 2 Sts. For every 7 we have 
nV — Vn’ = O. Hence S(n°s — Esn’) = O and if 0, = 
Since no § ¥ 1is inner this holds for just one S and V = SA where A is inner. 
A must leave the elements of ® invariant. Hence if a4 = ¢ ‘até, € commutes 
with all the as in ®. 

If © is a subgroup of G we denote the sub-quasi-field of elements invariant 
under the transformations of by P() and if 2 is any sub-quasi-field between 
® and P we denote the subgroup of © leaving the elements of = invariant by 
G(z). Note that @ < P(H) < P, P(G) = @ P(1) = P. The following is the 
fundamental theorem of the Galois theory. 

THEOREM 2. The correspondences $ — P() and 2 — (Zz) are inverses of 
each other. Either one sets up a (1-1) correspondence between the subgroups of © 
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and the quasi-fields 2 between @ and P. The order (P:%) = (P:’) = order of 
and (Z:@) = (Y:@’) = index of 

Let © be a subgroup of G and P() the set of invariant elements. If T is an 
automorphism of & leaving the elements of P() invariant, T = SA where S 
isin § and Aisinner. Since A = ST is in G this implies that A = 1, S = 
Thus G(P(H)) = Now suppose is given where @ S S P and 
let A be the set of 1. t.’s of Pover =’. ThenA <(P,S). If=SkeA 


Sv*ts = Sts = Sts) = Sy'ts 


Hence >> S(u* — y')&s = O. The process used above shows that &s = O unless 
= forall in i. e. unless SeG@(Z) = Hence A consists of the trans- 
formations p>» Sts. =’ is the complete set of endomorphisms commutative with 


those of A. On the other hand the form of the elements of A shows that these 
transformations are precisely the ¢’s such that P(). Hence P(@(z)) = &. 
The order relations follow from Theorem 1. 

If = = P(H), 2° = P(S'“HS). Hence § is invariant if and only if is 
transformed into itself by all the elements of G. If 1, S, .-- are representatives 
of the cosets of §, the transformations induced in = by these elements are dis- 
tinct and depend only on the cosets. Their totality is a group G@ y G/S. 


3. If P is commutative, £ any element in P, let &, &, --- , &” be its distinct 
conjugates. The coefficients of 


(x — t)(a — &) (a — £7) 


are invariant under @ and therefore belong to ®. Hence every element of P 
satisfies an equation with coefficients in # and with distinct rootsin P. It follows 
that P is a separable, normal extension of 6. To complete the Galois theory for 
finite extensions along these lines one needs only the converse theorem that if P 
is a finite, separable, normal extension of &, then the Galois group leaves in- 
variant only the elements of ©.° 

Let I be the center of P and A = [' (1) ®. The automorphisms S induce 
automorphisms S’ in T and A is the set of invariant elements relative to &’ the 
set of S”’s. Suppose first that the order of @’ = order of © = n. I the join 
of ® and is invariant relative to 1 only. Hence = P. If yi, v2, Yn 
is a basis for I over A every element of P has the form yigi + yae2 + +--+ + 
Ynn, $i in ® and since (P:®) = n, these elements form a basis for P over ®. 
Hence if we regard P, @ and I as algebras over A we have P= XT." It is 
then evident that the center of 4 is A, i. e. @ is a normal division algebra over A.” 


3 vy. d. Waerden, Moderne Algebra I, p. 150, 1 st ed. or p. 161, 2nd ed. 

‘ If P is an algebra over A with an identity 1, @ and r subalgebras containing 1, P= ® XT 
means that 1) P = 2) the elements of commute with those of and 3) if , ¢2, @m 
are linearly independent in @ over A and y: , yz, °** » Yr are independent in I over A then 
¢iys are independent in P over A. 

5 Since P = ® X I any element commutative with all ge@isin lf. It follows that if 
Vis any automorphism leaving the elements of # invariant then V « G. 
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The converse of this is clear. If P = @ X I where ® is a normal division 
algebra over A and I is a separable, normal field over A then the Galois group of 
I has order n = ([:A). These automorphisms have a natural extension in 
P = ® X I obtained by requiring that they leave the elements of ® unaltered. 
The automorphisms thus obtained are outer. 

THrorEM 3. Let P be a quasi-field, T its center. A necessary and sufficient 
condition that P have a group of n outer automorphisms which induce distinct 
automorphisms in T is that P = & X T where & is a normal division algebra over Aa 
subfield of Y and T is separable and normal of order n over A. 

If (P:1) is finite any automorphism leaving the elements of [ invariant is 
inner.” Hence in this case the group @’ is necessarily of order n. 

THEOREM 4. If P is a quasi-field such that (P:1) is finite, a necessary and suffi- 
cient condition that P have a group of n outer automorphisms is that P = ® XT 
where ® is a normal division algebra over A a subfield of T and T is separable and 
normal of order n over A. 

Now suppose I is a p-adic field’ and (P:F) = m?. Then (®:A) = m’andbya 
known theorem on p-adic algebras @ X TI is a division algebra if and only if 
(m,n) = 1.° Hence we have the theorem. 

Turorem 5. If P is a quasi-field with center T a p-adic field and (P:1) = m’, 
then any group of outer automorphisms has order prime to m. 

In particular P has no outer automorphism S of order | a prime divisor of m 
since S, S’, ..- , S' would then be outer automorphisms. 

We return now to the general case. Let A = I fl &, G’ the group of induced 
automorphisms and © the subgroup of elements T such that y” = y for all 
yin I. is an invariant subgroup of and = G/H. If = P(H), 
P = = = @ and the automorphisms S of © induce automorphisms § in = 
which form a group © = G/. Since = = I the S are outer and induce distinct 
automorphisms S’ in I’. Theorem 3 may then be applied to 2. We shall not 
attempt a more refined analysis of the structure of P here. We merely give a 
very simple example to show that P may be > 2. 


4. The following example due to Kéthe shows that the automorphisms of & 
need not be distinct inT.° Let Ty be any field of characteristic ~ 2 and T = 
To (1, Yr, %2, Y2, --- ) Where the z; and y; are distinct indeterminates. Let 
P™ = (a, ys) X (x2, X X (tm, Ym) where (zx; , y:) is the generalized 
quaternion algebra with basis 1, &;, 9: , &: such that 


m=yi, tim = 


is a quasi-field and hence so is P = = them 
am — £n'Qmem is an automorphism S,, of period 2 in P“ leaving the elements 


Deuring p. 48. 

7 i.e. a finite extension of the field of ordinary p-adic numbers of a field of power series in 
one indeterminate with coefficients in a finite field. 

8 Deuring p. 113 and p. 137. 

® “Schiefkérper unendlichen Ranges iiber den Zentrum,” Math. Annalen 105, p. 24. 
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of f unaltered. S,, has the same effect as S,_, in P“”. Hence S = lim S,, 
is defined and is an automorphism in P. S is not inner, for if aS = ¢“'et, 
¢ is in P‘” for N sufficiently large and hence 


s 


which is impossible. S’ = 1 and S = linT. Since ¢2 = 2x22 --- 2 the 
dimensionality over T of &” = P™ () dis 2°"-.” It follows that 1, m is a 
basis for over and hence for P over ®. 


5. Let E be a vector space over P, (E:P) = r and 7, = 1, 7s, ---, Ty a 
finite group of  semi-linear transformations such that the automorphisms S 
in P associated with Ts (ETs = Tst*) are distinct and outer. Let ® denote the 
set of invariant elements of P and H those of E. H is aspace over ®. The 
transformations )> Tsts form a ring isomorphic to (P, S) defined above." 
Hence these transformations may also be represented in the form }> Ej ji; as 
in §2. If ¢ = e£,, ¥ 0 then any vector in E’ = ¢ )> Tsts has the form 
fii + Sette + --- + where = andy;«W. It follows that if u ~ 0 
is any vector in E’, uW, = E’. The transformations induced by the }~ Tsts 
in E’ form a ring isomorphic to (P, S) also. Since E = ae ~ 0 there is a u 
in E’ such that = » 0. Then ne H and ¢ = 7 D_ Tsts = E in P. 
Since any = + + --- + €£n,, € is a linear combination of elements 
n: in H with coefficients in P. Hence E has a basis m, m2, --- , m over P. 
If » = 2n.€; is in H it follows that &; « ® and m, m2, --- , 7, form a basis for H 
over ®. 

THEOREM 6. Let E be a vector space over P, (E:P) = rand T, = 1, Ts, --- , Ty 
a finite group of semi-linear transformations whose induced automorphisms 
1, S,--- , Uin P are distinct and outer. If H is the set of vectors invariant under 
the Ts , ® the set of elements invariant under the S, then H is a space over ® such 
that (H:®) = r and the extension HP = E. 

We recall that if 7's is the matrix of 7s relative to the basis «, €, --- ,€, 
i.e. (aTs, ---,€-Ts) = (4, €,---, &)7s, then the matrix relative to a 
second basis @,---,€)Ais A The conditions = Ty when 
ST = U are equivalent to 777§ = 7st. Since the matrices of the Ts are all 1 
relative to (m, m2, ---, 7) we have proved 

TurorEM 7. If @ is a finite group of outer automorphisms in a quasi-field P 
and T's are matrices with elements in P such that T; = 1, TsT'$ = Tr, then there 
exists a non-singular matrix A such that Ts = A~'A® for all S. 


University oF NortH CaRoLina. 


10 If K is a subfield of a normal division algebra P and A is the set of elements commuta- 
tive with those of K, then (P:) = (K:F)(A:r). See Deuring p. 44. 

1 Evidently 2 Sts > » Tsés is a homomorphism. Since (P, S) is simple this corre- 
spondence is an isomorphism. 

12 Jacobson, ‘‘Pseudo-linear transformations,” Annals of Math. 38 (1937) p. 486. 
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ON A GEOMETRICAL REPRESENTATION OF p-ADIC NUMBERS 


By Kurt MAHLER 
(Received December 30, 1938) 


Let ¢ be a real irrational number, 


1 1 
= ao + Al + 
a | de 
its simple continued fraction, and 


the sequence of its approximations. The following theorems have been proved: 
THEOREM OF LAGRANGE: If p/q is a rational number, such that 


J Qn 


of 


for a certain index n = 1, then|q| = dn, with equality if and only if 7 = i: 


THEOREM OF Hurwitz-BoreEL: For at least one of any three consecutive indices n 

V5 

but to every « > 0, there is an irrational number £, such that 


1 -2 
for all sufficiently large n. 
TueoreM or Kuintcuine: There are arbitrarily large positive integers t, for 
which the inequalities 


< 


Qn 


at 


t 


1 
t|q|’ 


have no solution in integers p, q. 


10. Perron, Die Lehre von den Kettenbriichen, Leipzig-Berlin 1929, §15. 
2]. 1, §14. 
3 J. F. Koksma, Diophantische Approzximationen, Berlin 1936, p. 36 f. 
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THEOREM OF TcHEBYCHEFF:' If ¢ is an irrational, 3 an arbitrary real number, 
then there are arbitrarily large positive integers t, for which the inequalities 


1 t 
lag —p lal 


have an integer solution p, q. 

(Though the statement of the last two theorems does not mention continued 
fractions, their proof is much simpler if these are used.) 

In the present paper, I derive analogous results for P-adic numbers, when P 
is an arbitrary positive prime number; see the Theorems 17-26 in Part II. 
For this purpose it is necessary to develop a P-adic algorithm similar to that 
of the continued fractions in the real field. In two earlier papers,’ I have 
previously studied and applied an algorithm of this kind; the method there was 
purely arithmetical and based on Minkowski’s Theorem on linear forms. It 
had, however, some disadvantages, e.g. it did not lead to analogues to the 
Theorems of Hurwitz-Borel and Khintchine with good values of the constants. 

For this reason, I shall use in this paper a geometrical method, by which the 
best approximations of a given P-adic integer ¢ can always be obtained. (The 
restriction to P-adic integers is unimportant, since, if necessary, 1/f instead of ¢ 
may be considered.) This method forms something like a P-adic counterpart 
to Hermite’s method of introduction of continuous variables in the theory of 
forms.® It is based on the following idea: 


Let 
T, = (na) (n = 0, 1,2, ---) 
be an infinite sequence of integer matrices with the following properties: 
(a): — = 
(b): (dn Qn) = 1. 
(c): All matrices T7'Tn41 = Quy have integer elements. 


Then it is easily verified that to every n there is an integer A, , such that 
A, S P*—1, pa t+ = 0 (mod P"), pr + = 0 (mod P"), 
and such that 


Anu = An(mod P”). 
There exists, therefore, a P-adic integer ¢ as the P-adic limit 
¢ = lim A,. 


‘]. ¢. 3, p. 76 f. 
5 Nieuw Arch. Wiskde (2), 18 (1934), 22-34, and Mathematica B (Zutphen), VII (1938), 5 p. 
° Ch. Hermite, Oeuvres, vol. 1, Paris 1905, 100-163, 164-193, 200-263, and 1. c. 3, p. 40 f. 
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This number is determined uniquely by the sequence {T,}, and it is related to 
its elements T, by the inequalities 


SP", |pn tangle 


and the equations’ 
( “) or lim ( ; ) f 
qd: Qn 


We say that the sequence {T,} defines ¢. Conversely, every P-adic integer 
can be defined by a sequence {T,,}; it is sufficient to take 


(n = 0,1,2, +++), 


where A, is any integer for which 
Then (Any: — A,)/P" is an integer, and therefore also all matrices 


= ( (Anu ) 
are integral. The three conditions (a), (b), (¢) are therefore satisfied. 

Let us consider two sequences {T,} and {T,} as equivalent if they define 
the same number ¢. This is the case if and only if to every index n = 0, 1, 2, - - - 
there is an element P, of the modular group (i.e. an integer matrix of deter- 
minant 1), such that 


= T,P,. 


Among all equivalent sequences {T,,}, which define ¢, we can choose one as the 
reduced sequence. This we may do, for instance, in the following way: 

Let ®(X, Y) be the distance function of an arbitrary convex domain 
(X, Y) S 1 of area J in the (X, Y)-plane with center at the origin; thus 


#(0,0)=0, (X,Y)>0 for X°+ Y’>0, 
b(tX, tY) = | t| &(X, VY) for real ¢, 
+ X2, + Yo) S Yi) + Vo). 


With every element T, = (?-?») of {T,} we form the new convex function 


®,(X, Y) = + prY, + qnY); 


7 One of these limits may be different from ¢. 
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the domain ,(X, Y) S 1 is again convex with center at the origin, and its area 
is P-"J. Suppose now that for every n = 0, 1, 2, --- the function #,(X, Y) 
satisfies the inequalities 


(d): #,(1,0) = 1) S @(—1, 1) S 4,(1, 1); 


then we say that the sequence {T,} is reduced. By Minkowski, it follows 
from (d) that 


2P" 4P 
(e): J ,(1, 0)#,(0, 
Hence in particular, for all indices n simultaneously 
9p"? 
: n n any Yn) => 


It is not difficult to show that there is one and only one reduced sequence which 
defines ¢; by (f), the elements of this sequence lead to approximations — p,/qn 
for ¢.—It can also be shown that if {T,} is reduced, then every term of the 
allied sequence 


= Te Tent (n = 0,1, 2,---) 


belongs to a finite set of matrices, which depends only on P and on the function 
#(X, Y), but not on ¢. By means of this result, it is possible to determine 
indices n, for which the inequalities (f) can be improved, provided that such 
indices exist; it is further possible to obtain results analogous to the Theorem 
of Khintchine. 

This is carried through in the present paper for the case in which 


#(X, Y)? = AX’ + 2BXY + CY’ 


is a positive definite quadratic form of determinant 1. In this case, the in- 
equalities (e) and (f) can be improved a little. It is more important, however, 
that the conditions (d) now become the well known conditions for a reduced 
positive definite quadratic form, viz. 


—A, 2B, <A, SC,, 
if 
®,(X, yy = A,X’ + 2B,XY + C,Y’. 
Therefore, if in the usual way’ we represent the form ®,(X, Y) by the point zn 


in the upper half of the complex plane which satisfies the equation ,(z, 1) = 0, 
then z, will lie in the fundamental domain F of the modular group. Hence, 


* Geometrie der Zahlen, Leipzig-Berlin 1910, 193-196. 
* P. Bachmann, Quadratische Formen II, Leipzig-Berlin 1923, p. 17 f. 


3 
fi 
of 
. 
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as a geometrical representative of ¢, we get an infinite set of points z, in F. 
Two consecutive points of this set are connected by the equation 


+ 
Zn = 
+ Bas 


if Qa = ‘ena Ys . As we have already remarked, the matrices 2,4: have 
n+1 n+1 


only a finite number of possibilities independent of £; it is quite easy to find 
all these for a given prime number P. We can, therefore, study the elements 
of the sequence {z,} without specializing ¢; for instance, we can find lower 
bounds for lim sup yn, where = + Since 


and 

P” 2 2 

Yn Yn 
these properties of {z,} are equivalent to results on Diophantine approxima- 
tions to ¢. We give this investigation of {z,} in the first part, and the applica- 
tions in the second part, of the paper. : 

It is quite obvious that in a similar way matrices of any order may be con- 
sidered; we then shall get results on the simultaneous approximations of a 
system of P-adic numbers. I intend to study this problem and the analogous 
one in the real field in a later paper. 


Part I: THe REPRESENTATIVE OF A P-apic INTEGER 


1. The sequences Z(¢) and z(¢) 
Let I be the modular group of all substitutions 
+7 
1): 
with integer coefficients and determinant 1. Two points Z and z in the complex 
upper half-plane H, which are related by (1) are called equivalent. As is 


proved in the theory of I, to every point Z in H there is exactly one equivalent 
point z in the domain F of all points z = x + yi, for which 


(2): -As2<}, wt+y>1, o 2v+y=1; 


this domain F is called the fundamental domain of '.—It is useful to define the 
distance between two points 2: and z in F by 


(rq’ — = 1) 


ples 24) = min a+ = 1 +l); 
1 


then the neighborhood of a point 2; in F consists of all points zz in F, which are 
sufficiently near either to z; or one of the equivalent points 2; + 1, z: — 1, or 


| 
| 
| 
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—1/z,. In the following considerations, \ denotes a number in F, which is 


fixed, but arbitrary. 
Let P be a natural prime number and ¢ a P-adic integer: 


<1. 
For every n = 0, 1, 2, --- we define two integers A, and a, by 
(3): O5A,SP™-1 (Ao = 0), 
and 
— An 
(4): so that 0sa,SP-1 (ao = Aj), 
and 


lim A, 
Therefore ¢ is known if the sequence A(f) of all A, or the sequence a({) of all 
a, are given. 
By means of A(f), the sequence Z(¢) of all numbers 


(5): x, = | (Zo = 
in H is determined; obviously 
Zn + Gn 


A second sequence 2(¢) of complex numbers z, is obtained, if by z, (n = 0, 
1, 2,--- ) we understand the number in F which is equivalent to Z,. This 
sequence 2(¢) is called the representative of ¢. 


2. The matrices T,, and 2, 


Let 
6): — Gare = 1) 
QnZn + Qn QnZn 
be the modular substitution which connects Z, with z, , and put 
(7): = — Dn = — Anda- 
Then from (5) 
(8): = Prin Pn or Pn 
Qn2n + Qn — Pn 
or symbolically 


bid 
x 
= Tala; Zn = Ts. 


14 KURT MAHLER 


where T, denotes the matrix 


(9): Ta = 
n Qn 


Since 7ngn — Tndn = 1, T, has the determinant 
(10): — = P", 


and its two lower elements q, and q, are relatively prime. At least one of the 
q’s, therefore, is not divisible by P, say qx ; by p, we denote the element of T, 
above q,. In particular for n = 0, 


since Z) = 2 lies in F and therefore Z) = 2. 
The sequence T(¢) of all matrices T, has a number of simple properties. 


Obviously 
Hy 
0 1 mn Yn 


Therefore, if for n = 0, 1, 2, --- 
An+i 
(11): = Tan = ), 
nti Bn4i 
say, then 2,4; has the determinant 
Also 
= 
Qn Qn 0 1 0 1 Qntt 


or after a simplification 


Qn Qn 0 1 Qn+1 Qn+1 
so that all elements of Q,4; are integers. We denote the sequence of all Qai: 
by Q(¢). It can be determined from T(¢) by means of (11); conversely 


(13): Ty) = E, Tn = 2,0,---Q, for’ n = 1,2,3,---, 
so that T(¢) follows also from Q(¢). Obviously from (8) and (11) 


= = 
n+1 n+1len > 
+ — On41 


(14): Zn = = 


| 
| 
1 0 
| 
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Hence all 2,4; have the following important property: If 2,,:/ denotes the 
domain of all points z’ = 4:2, where z lies in F, then F and Qai:F have at 
least one common point. 


3. The theorem of finiteness 


From the last remark, we deduce the result: 

THEOREM 1: All terms Q,4; of the sequence Q(¢) belong to a finite set M(P) of 
integer matrices of determinant P, which depends only on P. : 

Proor: Since it is just as easy, we prove a more general result. Suppose t 
that Q is an arbitrary positive integer; let M(Q) denote the set of all integer meg 
matrices 2 of determinant Q, for which the two domains F and QF have at ; 
least one common point. Then we prove that M(Q) is a finite set. Now 
every integer matrix 2 of determinant Q can be written as 


= Pd, 
where P is an element of I’, and 


is an integer matrix, such that 
uw = Q, u> 0, w> 0, 0svsw-1. 


Obviously, there are only a finite number of matrices = with these properties, 
say the matrices 


Zh (h = 1,2,---,o). 
The transformed domains 
all lie entirely in the part of the z-plane given by 
V3 


| x | Q, = 
and therefore enter only a finite number of triangles of the modular division 
of the z-plane. Hence to every >, there can be only a finite number of elements 
(k = 1, 2, , px) 

of I’, such that P),2, belongs to M(Q), and the theorem follows at once. 


4. The cases P = 2, P = 3 and P = 5 of Theorem 1 


The last proof gives a method for the actual determination of M(Q) and 
in particular of M(P). We give here the results for the smallest prime 
numbers P. 

For this purpose we divide M(P) into three subsets M,(P), M2(P) and M;(P). 


iv 
0 w 
4 
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An element 2 of M(P) belongs to the first, second or third of these sets, according 
as the set of the common points of F and QF has positive area, or forms an are 
of a curve, or consists of a single point. For P = 2 both M2(P) and M;(P) 
are empty, but this is not true for any larger prime number P. 

In the following tables, the elements of M;(P) (j = 1, 2, 3) are arranged in 
such a way that any two of them stand one above the other, if their product is 
equal to + PE, and that they stand alone, if their square has this value. With 
Q also —Q belongs to M,(P); these two matrices lead to the same relation 
between points of F and therefore are not essentially different. 


Cass P = 2: 


M(2) = M,(2) has 26 elements: 


—{2 0 


M,(3) has 26 elements: 


M,(3) has 4 elements: 


i 
| 
| 
dl Cass P = 3: 
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M;(3) has 6 elements: 
7), 


2 1 
M,(5) has 58 elements: 


M,(5) has 6 elements: 


M;(5) has 4 elements: 


M,(7) has 90, M2(7) has 6, M;(7) has 10, M,(11) has 190 elements, ete. 


C 
) 
| 1 3) | 
| 
a4 
+(;° 
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5. The first existence theorem 


In this and the following paragraphs we shall prove some theorems, which 
show that every element of M(P) actually occurs in the set Q(¢) of a certain 
P-adic number ¢; if the element lies in M2(P) or M;(P), then it may, however, 
be necessary to choose \ appropriately. 

THEOREM 2: Let 


{Qnii} = , , O3,--- }; = 
Bat. 


be an infinite sequence of integer matrices of determinant P, and define a second 
sequence of matrices 


{Tr} {To, T1, Tz, --- } 


for n = 1, 2,3,--- 


Then there is a P-adic integer ¢ such that {Quis} = Q(¢) and {T,} = TS), of 
and only if (dn, Qn) = 1 and + in F for n = 0, 1,2, 


Proor: That the conditions are necessary, was already shown in §2; there- 
fore we have only to prove that they are sufficient. 

As in §2, let q, be that one of the two numbers q, and q,, which is not divisible 
by P, and pr the corresponding other element of T,,. Then there is exactly 
one integer A, such that 


Pn + =0 (mod P”), OFA, P*—1, 
and since P { Qn and — Padn = P", obviously 
Pa + GAn = 0 (mod + = 0 (mod P’). 
Therefore there are two integers r, and r, such that 
Pa = — Dn = — Anda, 
and the determinant ragn — adn = 1. Thus if 


Z, = Ant and a, = t 
QnZn — Tr 


? 


+ pr 


Zan = 
— Pn 


lies in F by hypothesis. 
I assert that 


Anu = An (mod P”). 


| 
| | 
| 
if 
( 
| 
| then 
| 
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For 
QnOnti + QnBait, + 
hence 
Dasa + QnttAn = Pati + = + 
+ Qn+tA ni = 0 (mod P"), 
from which the congruence follows immediately. If ¢ is defined by 
¢ = lim A,, 


then this limit exists as a P-adic integer and satisfies all the conditions. 
, 
Corouiary: Let T = 4 be an integer matrix of determinant P”, such 


hab ang lies in F. Then there is a P-adic integer such 


that T is the element of T(£) with index n. 
Proor: We define the integer A, by 


p+ Ang = p’ + Ang’ = 0 (mod P), 054A, s P" -1, 


and take a P-adic integer ¢, for which |¢ — A, |» < P"; then by the proof 
of the last theorem, ¢ satisfies all conditions. 


6. The second existence theorem 


THEOREM 3: Let 2* be an arbitrary number in F, a and B two relatively prime 
integers, € an arbitrarily small positive number. Then there is an index n > 0 


and a P-adic integer ¢, such that the elements T, = > Ps) of T(¢) and zn of 2(¢) 
satisfy the conditions 
adn + Ban = +1 (mod P), |Zn — 2*| Se. 


Proor: Since (a, 8) = 1, there is an integer unimodular matrix 


Q = 
Let D be the set of all points z in the upper half plane H, for which 


_ azt+a’ 


+ 


satisfies the conditions 


QzliesinF, —2z*| Se. 


Obviously this set has at least one inner point, say the point u. 


| f 
it 
=. 
“ 
aa 
| 
| 
i 
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For 7 = +1, consider the matrix 4, = » 


re ) of determinant P, and the 
corresponding substitution 
®,z. 
Obviously, this can be written as 
22’ + (1 — _ 
22’ +4 + (1 — 4P)! 


where the factor 


22 + — (1 — 4P)? 


_—(1—4P)} 
a+ (1 — 4P)! 


8 a complex number of modulus 1. Since 


#3 is not an algebraic integer, and therefore not a root of unity. Hence the 
powers 3, #*, #*, .-- lie everywhere dense on the unit circle in the complex 
plane. 

Therefore, if Z is an arbitrary point in the upper half-plane H, and K,(Z) 
the circle 


22 ++ (1 — 4P)! 
through this point, then with z also #,z lies on K,(Z), and the points 


2Z ++ (1 — 


obtained from z by repeated application of ®;* lie everywhere dense on this 
circle. 

Now it is easy to see that we can connect » with » by means of a continuous 
curve C, which consists of arcs of the two circles K_,(A) and K_;(u) and of 
circles K,(k + 3 + (1 — 4P)'), where 7 = 1 and k = 0, 1, 2,--- ; for any 
two circles K(k + 4 + (1 — 4P)') and + 44+ 1 4P)'), where 
k = 0,1,2,---andk S$ « Sk + 3/2, intersect. Let \, 4, , Ar, 
uw be the successive end points of the arcs composing C. Then by alternate 
applications of powers of ®—; and #3; , we can transform \ successively into 
points arbitrarily near to \;, \2, --- , A: , and finally into a point 2** near to u, 
which lies in D. In this way, we obtain an integer matrix 


* 


if 

| 

| 
| | 
i | 

| 
| | 


GEOMETRICAL REPRESENTATION OF p-ADIC NUMBERS 21 


where the exponents h, k are integers = 0, such that 
A= T*%s**, or z** = 


the determinant of T* is a power of P, say P”. Since 


necessarily 
0 
* 
T* = oe (mod P). 
Now put 

so that 


A=Trzn and z, = Tz A, 
and, by the definition of D, 


zn liesin F, and |z, — 2*| S«. 


agn + = +1 (mod P), 


and from prdn — GaP» = P” in particular (qn , qn) = 1. Hence, by the corollary 
to Theorem 2, the theorem follows immediately. 


We have 


therefore 


7. An application of Theorem 3 
To every element 2 of M(P) belongs a certain set S(Q) of points z in F, such 


that the transformed points 2”’z also lie in F; this set has inner points, or con- - 


sists of an arc of a curve, or of a single point, according as 2 belongs to M,(P), 
or to M;(P), or to M;(P). 

TuHEorEM 4: Let Q be an element of Mi(P). Then there is a P-adic integer §, 
such that Q is an element of the sequence 2(f). Provided that d is chosen suitably, 
this is also true, if 2 belongs to M,(P) or M;(P). 

Proor: Suppose that z* is a point of S(Q), in particular an inner point, if 2 
belongs to M,(P). IfQ@ = (: 4 then either (a, 8) = 1 or (a’, B’) = 1, for 
a8’ — a'B = P. Denote the pair, which is relatively prime, by a*, 6*. Then, 


x 
| P); 
> 


22 KURT MAHLER 


by the last theorem, we can find a P-adic integer {> for which z, is arbitrarily 
near to z*, and for which T, = os © 2) satisfies the condition 


a*q, + = 1 0 (mod P). 
Thus, in particular, if 


Qnti 


then (dni1, Yn41) = 1, since one of the q’s is not divisible by P. 

Now, if Q lies in M,(P), then we can choose { in such a way that z, lies in 
S(Q). If, however, 2 is an element of M2(P) or M;(P), then by a slight change 
of \, so that z, remains in F, we can again obtain a { for which z, lies in S(Q). 
Hence in both cases the number 


= Den = 


lies in F. Therefore by the corollary to Theorem 2 there is another P-adic 
integer ¢, such that Q and 2,4; are the elements of Q(¢) and z2(¢) with index 
1. The proof of Theorem 2 shows that necessarily — P"; 
hence T, also belongs to T(¢), and Q is an element of Q(¢). 


8. The sets m(P) and m’(P) 


Let Q = ws a be an element of M(P). Then the fix points of the substi- 


tution 


, a +a’ 


satisfy the quadratic equation 
Bz’ + (6’ — az — a’ = 0, 


and are therefore 
(15) 
26 26 
if 8 ~ 0. Let us suppose that 8 > 0, and that both roots (15) are conjugate 
complex numbers, i.e., that 2’ = Qz is an elliptic substitution. Then if 


+ — 4P)} —((a+ 8’)? — 4P)! 
(16): f= he = 


fi is the fix point with positive imaginary part, and fz the conjugate one. Ob- 
viously, z’ = Qz can be written as 


(17): Joh 


{ 
| 
| 
1) 
| 
| 
q 
eg. 
j 
| 
Si 


se 


where the multiplicator 

_%— _at+ — ((a+6’)’ — 4P)! 
a+ ((a+ — 4P)} 

satisfies the quadratic equation 


(19): (2-248 \ 541-0. 


Let us now denote by m’(P) and m(P) the subset of those elements 2 of M(P), 
which are elliptic, whose fix point f; lies in F, and whose multiplicator 3 is a 
root of unity, or is not a root of unity, respectively. 

For the elements 2 of m’(P), the coefficient 


(a + 
P 


in (19) obviously must be equal to 0, or to #1, or to #2. Hence necessarily 
P=2anda+ fp’ = +2,0rP=3 anda+’ = 43, or P is arbitrary and 
a+ 6p’ =0. By the tables in §4, m’(2) has six elements 


F(a) FC FC 


m’'(3) has six elements 


and m’(5) has four elements 


9): 


If P = 5 and @ belongs to m’(P), then a + 8’ = 0, so that 


(18): 


and therefore 9 = —PE. 
Also in the case of m’(2) and m’(3) 


and 


Hence no element 2 of m'(P) can occur twice in succession in the sequence Q(), 
since otherwise all elements T, of T({) with sufficiently large n would no longer 
satisfy the condition (qn, da) = 1. 
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For the elements of m(P), 
(20): a + B’ # 0 (mod P). 
For Q is elliptic, and therefore 
(a + —4P <0, ie. |a +’ | < 2vP, 
and so, if (20) were not true, 
P<2VP or P <4. 
The tables in §4 show, however, that (20) holds also for P = 2 and P = 3. 


9. A congruence for the powers of a matrix 


Let Q = C 7 be an element of m(P), or more generally any elliptic integer 


matrix with determinant P’ (g = 1), for which (20) is satisfied. Then the 
equation 


has two different roots ¢; and ¢e in the field of the P-adic numbers. For define 
((a + B’)® — 4P*)' as a P-adic integer by the convergent series 


(21): 


((a + — (a+ (a +6’) 


then these roots are given by 


(22): 5 5 
Obviously 
(23): ¢i = a + B’ ¥ 0 (mod P), ¢g2 = 0 (mod P). 

It is well known that for every integer k = 0 

— $2 = 

Hence, from (23), 
(25): a = (a + p’)* "2 (mod P) fork = 1. 


More generally, if T = ( , is an integer matrix of determinant P”, put 


i Th 
| (2¢ 
Th 
| (a, 
] 
| 
an 
| 
Th 
| 
to. 
| 
i: or 
an 
Su 
| 
wh 
| 
| 
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Then, from (25), 
(26): T* = (a + 6’)* 'T@ (mod P) fork = 1. 
Therefore (q*, g*’) = 1, of 
(2,8) = 1 and ag + Bq’ A 0 (mod P), or 
(a’, = 1 and a’g + 0 (mod P). 
From the definition of 7%, 

— v2)p* = gilp(a — + p’B} — — + 

(vr — = — v2) + — — + 
and therefore 


This identity shows that —p*/g* tends to a P-adic limit for k — ©, since 
|ge|» < 1, and that this limit lies in the quadratic field given by (21). 


10. The third existence theorem 


In §1, we defined a generalized distance p(z: , z2) between two points 2; and ze 
in F. With this definition, the following theorem holds: 


TurorEM 5: Let 2 = bean element of m(P), fi its fix pointin F. Then 

to every « > O there is a P-adic integer ¢ such that for all sufficiently large n 
p(Zn s €. 


Proor: Let 6 > 0 be a constant to be assigned later, and a*, 6* the pair a, 8 
or a’, B’, for which (a*, 6*) = 1. By Theorem 3, we can determine an index m 


and a P-adic integer for which T, = Pr) and z» satisfy the conditions 


dm 


+ #O(mod P), |%m—fil <6. 
Suppose that 
Th = --- Qn 
where Q , 22, --- , Q» are the m first elements of 2(¢o). 
Let us form the matrices 
* 
Qm+k YUmtk 
and the numbers 
= Dem, (k = 1, 2,3, ---), 


| 
at 
XS 

{ 
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which lie in H. From (17) in §8, obviously 
— 
Zmtk — Je 2m — Ja 
where f; and f2 are the fix points and 3 the multiplicator of 2. Since © belongs 


to m(P), 3 is a complex number of modulus 1, which is not a root of unity, 
Hence the points zie for k = 1, 2,3, --- lie ona circle K given by 


z* — fe im — fe 


and lie everywhere dense on K. To every point z* on K, there is a point z, 
equivalent with respect to I’, in F; it is obvious that we can choose 6 and there- 
fore also the radius of K so small that all these points satisfy the inequality 


p(z, fi) 
Hence, in particular, if zm+, is the point in F which is equivalent to Sate , then 
fi) S € - (k = 1, 2,3, ---). 
Now by the definition of equivalence, 
= Png (k = 1, 2, 3, 
where P,,,; is an element of [. Similarly 
= - 
From 
= 
and from the two preceding equations, 
(28): = 
where 
= 
If 
= LP mis, 


then (28) holds also for k = 0. 
Put 


T, = = T*P, = T,2” P, 


T. = = Tah 


= 1. 


| | Fo 
| anc 
| 
Bu 
{ 
| 
{ 
| | sat 
| the 
a 
| | In 
i 
Siu 
if, 
K 
ca 
| fre 
sh 
co 
ha 
* 
: 2m 
— 
| 
hi 
forn 2 m. 
Wl 
Then 


For if both g, and In were divisible by P, then the same would be true for qn 
and qn! in 


But by the last paragraph 
= (a2 + (mod P) (n => m+ 1), 


and therefore by the construction of T,, , either g, or qn’ is not divisible by P. 
It is now clear that the sequence of matrices 


To = E, Ta = --- (n = 1, 2,3,---) 


satisfies the conditions of Theorem 2. Hence there is a P-adic integer ¢ (for 
which, by the way, obviously — fo|» P-™), such that {T,} = and 
{Q,} = Q(¢). By construction, the elements z, of 2(¢) with n = m + 1 satisfy 
the required inequality p(z,, fi) «. 

Corotuary: If ¢ is the P-adic integer given by Theorem 5, then, for an infinity 
of indices, 

=2. 
In particular, these two equations are true for all sufficiently large n, if the fix 
point f; of 2 is an inner point of F. 

Proor: Let K be the circle defined in the proof of the preceding theorem. 
Since fi belongs to F, the points of K in F will form an arc of positive length; 
if, in particular, f; is an inner point of F, and e, i.e. 6 is sufficiently small, then 
K lies entirely in F. Hence in the former case an infinity of z+, in the latter 
case all zn44 with k = 0, will lie in F, and the statements follow immediately 
from the proof of Theorem 5. 

RemaRK: Since T, = T,” ™ for an infinity of indices n, formula (27) in §9 
shows that ¢ is a quadratic irrational P-adic number. Its sequence Q(f) will 
consist only of matrices Q for large indices, if f; is an inner point of F and e is 
sufficiently small; hence in this case Q(¢) will be periodic. If, on the other 
hand, fi is not an inner point of F, or ¢ is not small enough, then the points 
Zm+k Will not all belong to F, and since # is not a root of unity, it is easy to 
see that Q(¢) is not periodic for large indices. Hence Lagrange’s Theorem 
on the periodicity of the continued fraction of a real quadratic irrational number 
has no analogue for P-adic numbers. 


11. The function Y(P) 


For every element 2 of m(P), let y(Q) be the imaginary part of the fix point 
fi of Qin F, and put 
(29): Y(P) = min y(Q), 
where the minimum refers to all elements of m(P). Obviously 


(30): Y(P)= 


GEOMETRICAL REPRESENTATION OF pP-ADIC NUMBERS 27 


* 
* 
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For P < 5, the tables in §4 give 


Y(3) V2, Y(5) = 1, 


corresponding to the matrices 


with fix points 


1 -2 

1F¢V-7 
2 


and multiplicators 


-3- /-7 3-4 


THEOREM 6: Y(P) = aA if and only if P = 1 (mod 6). 
of m(P) with their fix point 


This condition will be satiafied if 


Proor: We have to find all matrices 2 = i 


Bp + (8' — a)p — a’ = 0, 
or, since p is a root of the irreducible equation p’ + p + 1 = 0, if 


a’ and p’=a+8. 


_ fe. 2 2 
(5 and at + = Qa +p) +364 = P, 
Hence either P = 3 or P = 1 (mod 6). If P = 3, then necessarily 
1 


and these matrices do not belong to m(P). If, however, P = 1 (mod 6), then 
B is not divisible by P, so that 


is not an integer; hence the matrix Q = c 


and has p as its fix point f; . 


| 
| 7 
—2 (7 —1 
= 
| +i, ] 
2 
| of 
| “4 
‘ 
i.€ 
in 
| 
| Therefore 
| 
| 
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It is very probable that 
lim ¥(P) = 


but so far I have not been able to prove this.” 
For P < 103, Y(P) is given in the following table: 


p=2 Y(P) = P =29 Y(P) = V(91)/10 = 67 Y(P) = V(8)/2 


3 V(2) 31 v(3)/2 71 5.x/(11)/18 
5 1 37 V(3)/2 73 Vv (3)/2 
7 v (3)/2 41 v(8)/3 79 v(3)/2 
11 V/(35)/6 43 ¥(8)/2 83 V(74)/9 
13 V(3)/2 47 (187) /14 89 2./(10)/7 
17 V(8)/3 53 V(91)/10 97 V(3)/2 
19 V(3)/2 59 V(55)/8 101 Vv (65)/9 
23 V(91)/10 61 V(3)/2 103 V(3)/2 


12. The function Y(¢) 


Let xz, and y, be the real and imaginary parts of the n-th element z, = 2, + tYn 
of the representative 2(¢) of a P-adic integer ¢. Suppose that 


(31): Y(¢) = lim sup yn; 


ie. Y(¢) is the greatest number, such that for every « > 0 and an infinity of 
indices 
yn = —«. 
Obviously 
(32): 2 V@)/2. 
THEOREM 7: 7'o every positive number «, there is a P-adic integer ¢, for which 
Y¢) = Y(P) + «. 
Proor: Let 2 be an element of m(P), such that Y(P) is the imaginary part of 


its fix point f; in F. By Theorem 5, there is a P-adie integer ¢ for any given 
5 > 0, such that 


1 
th 


fir) = min fi —fit1|, len —fi— 1], | +5], 


for all sufficiently large n. This implies in particular 


ly» Y(P)|S«, 


% Addendum: This equation has now been proved by H. Davenport; see the following 
paper, these Annals, pp. 59-62. 


we 
f 
i 
iil 
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if 6 is chosen sufficiently small. For all points in the domain F’: 


which are equivalent to f; , obviously have the same imaginary part Y(P) as f; ; 
therefore the statement follows immediately from the continuity of the function 


p(z, fi 1). 
Probably the following conjecture holds: 
“For every prime number P, and for every P-adic integer £, 
2 Y(P).” 

For P = 1 (mod 6), thus in particular for P = 7, this conjecture is indeed 
true, as is evident from Theorem 6 and the trivial inequality (32). I shall prove 
the following three theorems, which show that it also holds for P = 2, P = 3, 
and P = 5, and therefore for all prime numbers less than 10. 


13. The lower bound of Y(¢) for P = 2, P = 3, and P = 5 
TuHEoREM 8: For every index n and for any diadic integer ¢ 
Vv 


max (Yn, Yny2) = 


Hence in particular for all diadic integers ¢ 


2 = ve. 


THEOREM 9: For every index n and for any triadic integer ¢ 
MAX (Yn, Ynt1, Yni2) = V2. 
Hence in particular for all triadic integers ¢ 
2 V2 = Y(8). 
THEOREM 10: For every index n and for any pentadic integer ¢ 
max (Yn, Ynti) 2 1. 
Hence in particular for all pentadic integers ¢ 
21 = Y(6). 


The proofs of these three theorems depend on the following simple con- 
siderations. In 


(33): Zn = 


2,41 must be an element of M(P), and both zp = tn + tyn and Zay1 = Daga + oti 
must be points in F and therefore also in the closed domain F’: 


33, |2|21. 


| 
A 
de 
W 
| 
e. 
SI 
| 
| 3 
a 
| 
4 
fc 
0) 
| 
3 
fe 
| 
4 
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All we have to do is to determine, for every 2 in M(P), the greatest partial 
domains S, of points z, and S,4; of points 2,4; in F’, which are related by (33) 
with 2,4; = @. In certain cases, these domains may degenerate into an are of a 
circle or a segment of a straight line, or into a single point; if they contain inner 
points, then they are mapped conformally one upon the other. 

For some of the elements of M(P), one or the other of the domains S, , Sr4 
will contain only points, for which the ordinate y,(yn41) is sufficiently large, 
eg. 2 V7/2 for P = 2. In other cases, both y, and y,,1 may assume values 
smaller than the required bound, but not simultaneously, so that one of them 
always remains sufficiently large. In still other cases, both y, and Yyn4: will 
become too small at the same time. Then it is necessary to consider two consecu- 
tive (equal or different) elements 2,41 , Qn42 of 2(¢), one of which is equal to Q, 
and the three numbers Zn , Zn41, 2n¢2 = + connected by 


As in the simpler case above, there will be the three greatest domains S,, Si4:, 
Si.42 of points Zn , Zn41, Zn42 in F’, which are connected by (34). The necessity 
for considering these sets of three domains will arise only in the proofs of The- 
orems 8 and 9. In all instances it will be easy to show that at least one of the 
numbers Yn, Yn+1, Yn42 iS always sufficiently large. Only those combinations 
Qn41, Qn42 have to be considered for which the simpler method fails; and when 
the product 0Q,4:2,42 is divisible by P, then also this combination can be excluded 
(compare §8). 

Matrices 2,41 and —Q,4: lead to the same relations (33) or (34), so that only 
one of them has to be considered. Also our method is symmetrical in y, and 
Yn+1 in the case of a single matrix, and symmetrical in y, and y,+2 in the case of 
two matrices. Hence in the first case, if one of the ordinates is sufficiently large 
for Qn4, = Q, then the same will be true for 2,4, = +PQ™; and in the second 
case, combinations , Qn42 lead to the same bounds as 

We arrange the steps of the proofs in the form of tables, so as to render them 
clearer. 


14. Proof of Theorem 8 


of M(2) satisfy the equations 
-1 “\-1 0O/’ 
-1 1f\-1 \-1 -1/\-1 1/ 


The elements 


A : 
on 
od 
ve 
3, 
| 
¥ 
1 
-1 -1 1 0/7’ 
| 
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Hence at least one of any two consecutive terms of Q(f) is different from these 
four matrices. 
For the remaining elements of M(2), the inequality 


7 


max (yn, = 


is satisfied, as follows from the following table: 
Similar 


results for 
the matrices 


Inequalities for 


max (|Zn|, |Zn41!) 
> v2, 
| 3, 
( 0 2 | ngs] s 3, 
0 
hence 


max (Yn, Yn41) 


0 |an| $4, |2n41 | S 3, 0 


be 1) 54, S—}, (= 1 (5 
01 0 


-2 
2| $2, | — 1| 2, 
1 |2n| 21, | + 1| 2 2, 0 (: 2 
In S 4 S 4 0 


15. Proof of Theorem 9 
We divide M(8) into three subsets S; , and where S, consists of the 


6 elements 


Se of the 4 elements 
1 —1 
(Gi 


( 
(- 
| 
— PQ +1 
| 
1 
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and S; of the remaining 26 elements of M(3). The relations 


-1 —1 -—2\ 3(— 
2 1 2 0 
show that at least one of any two consecutive terms of 2(¢) does not belong to S; . 
Similarly it is evident from the equations 


1 1\)_,/0 1 
-1 1f\-1 1) “\-1 O/’ 
-1 1f\-1 -1/ -1 - 1 -1/ OP’ 
1 2\/-1 -2)_,/(1 
2 1f/A\-1 “\-1 1)’ 
that two consecutive terms of 2(¢) cannot have the form 


: -1 2 
Qn42in S; or in Si, = (=I 


In the twelve remaining cases, we obtain the table shown on page 34. 
Next, it is obvious from 


that two consecutive elements of 2(¢) cannot be 


2 —-1 2 


(2, 


e 
WS 2 1\/-1 1\_.,f/-!1 0 
Deel -1 1\/-1 1\_.,/f0 -1 
| 2 1/)\-1 “\-1 
-1 1 4 
| -i a\Vf-1 -2 


= 
= 
= 
= 
= 
i=) 


34 


— 575 — 
~—+-— =z 
‘ $ uf T u 


JO} 


IO} 


gl 
| 
4 
4 
| 
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#4. 4 


It is, however, possible that = Quis = 


or 


+2 _ —2ny2 + 4 Zn42 + 2 
— + 1 — — — 2n+2 + 1° 
Then 
Yn = You = 
(2rn+2 + 1)’ + (In42 1)’ + 


Suppose that S V2, S V2, so that 


3Yn 
(tn42 Yn42 = v2, (an42 1) Yn+2 mss) 


V2 


Since 3/2 S yni2 S V2, and (3, ) assumes its minimum in any inter- 


val in one of its end points, 


Therefore —3} S 2as42 S O, and finally 


2 2. 
A similar proof holds if 


2 
Our discussion has now exhausted all cases in which two consecutive elements 
of Q(¢) belong to S; or Sz. There remain the elements of S; ; for these the 
table shown on page 36 is obtained. 
16. Proof of Theorem 10 


For P = 5, it is not necessary to consider two consecutive elements of 2(¢), 
as we shall find that the theorem holds in its stronger form with only two 


ordinates. 
M(5) has 68 elements (see §4). Of these, only the following eight 


require particular examination. 


' 
i 
—— 
| 
j 
} 


36 


0 T =. — = tug 

0 I-\ I-)\| (0 I- Sig we eS 

0 T O\\/e- 0 su T= 0 

‘ < — 
0 su 0 
<— ) 
< 
= ("A 


10} S}[NSer 


10} 


2 2 
+ 2)? + 


—-1=8 
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+ 2)? + (van ~ (3) 


1=-— 


Now every point Zn41 for which 

lies inside the circle 

— $) + yar = 8)’, 

and every point for which 

Stu 0, | | 2 1, Yau < i, 
lies inside the second circle 

+ 2)? + — = 


Hence in the first case | z,| < 1, so that zn.4: does not belong to Sy4: ; in the 
second case y, > 1. This proves that max (yn, Yni1) 2 1 generally. The 


“ag —] 2 
same result holds for ( 1 Pu 7-—o ( 2 ) 


Similarly, if = 4 , then 


2 
2 2 
3 3 
3 2 2 
(2ans1 — 1)" + 
1\ 
(2-3) + (8) 
(Qatnsi — 1)? + 
Hence again the points 2,4: for which 
—} 0, | | 2 1, Yau < 1, i.e. + > 0, 


le. 


Yn —1 = —4 


do not belong to S,41 , and y, > 1 for all points 2,41 for which 
OS S43, 21, yor <1, ie. (ans: — + (Yeu — — <0. 
Thus always max (yn, Yni1) 2 1, and the same result holds for 


For the remaining 60 elements of M(5), the table shown on page 38 is obtained: 


f 
= 
i 
| 
| 
| 
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I- 


5 


-) ‘(c— 


| 


if | ug | 


‘CA 


= 
("A 
gousy 
| tux | = | “ar | 


> 
= (| | “| |) 


IO} 


10} 
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17. A characteristic property of a rational number 
Let us return to the case of a general prime number P, and divide the elements 


2 = * ri of M(P) into two sets M’(P) and M’’(P), such that 


B ~ 0 for the elements of M’ (P), 
and 8 = 0 for the elements of M’’(P). 


THEOREM 11. Suppose that ¢ is a P-adie integer. A necessary and sufficient 
condition that all elements of Q2(£) with sufficiently large index belong to M'’(P), 
is that ¢ is a rational number. 

Proor: A) The condition is sufficient. Suppose that 2, belongs to M’’(P), 
say for all indicesn 2 m-+1. Put 


Quin = Qa, 
so that 
Ze = 


where a 2 0, b 2 0 and Q are rational integers, and where a + b = n — m. 


Since both z» and z, lie in F, and 
z, = — 


Obviously 


necessarily 
(35): lim b = +0. 


n Qn Gm P* dm Qamn + 
and therefore by §2 


P*(pm + = 0 (mod P"), ie. pm + = 0 (mod P™*). 
Hence, from (35), 


But 


= Paadie tim (—?*) = 
Qm 
as was to be proved. 
B) The condition is necessary. Suppose that ¢ = —p/qg, where q = 1, and 
(p,q) = 1. Since ¢ is a P-adic integer, (P, q) = 1. 
To every index n, there is an integer A, , such that 


0<A,<P"-1, p+qA,=0 (mod P’), 


| 
| 
| 
j 
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and a second integer r, , for which 
p+qA,=1,P”. 

Hence 

(36): (q, a) = 1, 


since every common divisor of q and r, is a divisor of p. 
From (36), the equation 


(37): — = 1 


has integer solutions; if = = rs is one of them, then the general solution 
is given by 


qn = + kg, =rat 


where k is an arbitrary integer. We assume that this integer k is chosen such 
that the real part of the complex number 


qn 
satisfies the inequalities 
— 435 RE.) < }. 


Obviously the imaginary part of ‘the complex number aw is positive. 


Hence, if n is sufficiently large, z, is a point in F. 
Now, if as in §1, 


An +h _ —(p — gs) +7, P" 


qP* 
then it is easily verified that 
(39): on Tate 
+ Qn 


Hence z, is the z, of §1 for all sufficiently large n. 
From (38), for two consecutive indices n and n + 1, 


(40): ent + 


Now Any: = An + a,P” with an integer a,, hence 
and p+ q(An+ = 
and therefore 


te = — 


| 
) 

i 
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Substituting this for r, in (37) and subtracting the same equation with n + 1 
in place of n, 


= Gn+1) + — 7a, — rn) = 0, 


and finally from (7ni1, q) = 1, 
— = 0 (mod 4). 


Hence «4: in (40) is an integer, and the matrix 


Qu = 

P 

of the transformation (40) is indeed an element of M’’(P), as was to be proved. 
Coro.uary: For a rational P-adic integer ¢, 


(41): lim yn = +0. 


Proor: Immediate from (88). 


18. A characteristic property of an irrational number 


As we shall now prove, formula (41) is not true for irrational P-adic numbers. 
Put 


(42): y(¢) = lim inf yn, 


so that 


V3 


THEOREM 12: If ¢ is an irrational P-adic integer, then for an infinity of indices n 
yn VP, 
and therefore 
(43): S VP. 
Proor: By Theorem 11, there are arbitrarily large indices n + 1, for which 


= does not belong to M’’(P), so that Bai ~ 0. Hence 
B n+l 


On+12n41 + Qn+1 


en = 7 
Bn4izn41 + Bai 


can be written as 
(Bn412n On41) + —P. 


Butt Basi | Bn: | 


Therefore 


| 
0 
Pa 
| 
| 
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or 
| | 
and the statement follows immediately, since | 8,4: | 2 1. 
The question arises, whether the constant ./P in Theorem 12 is the best 
possible. This question is answered in the affirmative by the following 
TuEorEM 13: To every « > 0, there is an irrational P-adic integer ¢, for which 


Proor: Let g be a large positive integer, and Q the elliptic matrix 


0 pet 
(_. 1) 
of determinant P”’*'; we suppose that g is so large that the fix point 


_ —-1+(1 — 


min (yn, yay) S 


of 2 in F satisfies the inequality 


where 6 is a positive constant to be assigned later. 
By Theorem 3, there is a P-adic integer {) and a positive integer m, such that if 


Tn = ‘on P *) and zm are the elements of T(¢) and 2(¢) of index m, then 


(44): 0 (mod P), —fi| S 6. 
Put 


ok 
= 4 


1 1,2,---, 
for h=g+1 


1 


and forn => m+ 


*, 


T* = (?: = Ome 


q (3; for h=1 
1 4 
for h = 2,3, 
| P g and k = 0, 
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Since 


* * * 0 P\f/1 1\/P 
Qm+k (2941) +1 +2 = _1 oo P 0 P)\\o 1) 


obviously 
* ok 


and in particular 


= Ta. 
Hence, by formula (26) in §9, 


= = (— 1)! (mod P). 


From the first condition (44), therefore, (q% , g2’) = 1 for n = m+ k(2g + 1), 
and this must remain true also for all other indices n => m + 1, since T> is a 
left-hand divisor of all T7441) With greater index. 

Let zx be the complex number 


= = Ta A (n> 


so that in particular 


* —k * 
2m-+k(29-+1) 


As in §10, since @ has the fix point fi , 
| — fil S fork = 0,1, 2,---, 


and therefore 


t be fork = 0,1,2,---, 
where both 6; > 0 and é: > 0 only depend on 6 and tend to zero with 6. In the 
second inequality 

~ 
fi 


Suppose now that e < 3, and that 6 is so small that 
max (6 + 6,5 + < «. 
Then, from the definition of g and from the last formulae, 
| — forh = Oorl, k = 0,1, 2, ---, 


so that in particular all points 


* * 
Zm+k(2041) » 


lie 
| 
| (4 
| 
st 
! 
} 
| 
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lie in F, and their imaginary parts satisfy 


(45): 2 VP — « for h = 0or 1, k = 0,1, 2, ---. 
The remaining z, are given by 
for h = 2,3,---,g 


for h=g+1 
for h=g+2,9+3,---, 2g 


andk =0,1,2,---, 


so that for these numbers 
(46): 2 — ©) forh = 2,3, ---,29,k =0,1,2,---. 


Suppose now that for n = m + 1, za is the point in F equivalent to z. with 
respect to I, and that 


2. = Pan» 
where P, belongs tol. Evidently 
= forh = Oorl,k =0,1,2,---, 
and 
(47): $n) = forn = m+ 1. 
Put 
= = = for n = m + 2, 
and 
so that 
= 
and therefore in particular 
= forh = 0or1,k = 0,1, 2, ---. 
Since (qx , gx’) = 1 and T, is a left-hand divisor of T? = T,P;', obviously also 
(Qn Qn) = 1 (n = m + 1). 


Furthermore, all numbers 
Ze = Ta 


lie in F, as follows from the construction of z, and T,, . 


| 
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Hence, if 
, ,Qm}, {To, Ti, --+ , Tm}; {Zo 21, 2m} 


are the elements of Q(fo), T(o), and 2({>) with indices n S m, we can apply 
Theorem 2 to the infinite sequences 


2,23, +--+}, {To, Ti, T2,---}, {20 21,22, 


We obtain the existence of a P-adic integer ¢ (for which, by the way, | £ — fol» S 
P~™), such that {Q,} = (¢), = T(O), {zn} = 2(¢). This number is irra- 
tional by Theorem 12, since 


F 
= = (k = 0,1,2,---) 


does not belong to M’’(P). (Formula (27) in §9 shows that ¢ lies in a quadratic 
field; compare the proof of the next theorem.) Also from (45), (46), and (47) 


Yn = VP 
for all n = m, so that ¢ satisfies all conditions of Theorem 13. 


19. Additions to the last theorems 
THEOREM 14: If ¢ is the P-adic integer of Theorem 13, then 


VP—eSy. 8 VP, | Dn + |p = P” 


for an infinity of indices. 
Proor: We specialize the formulae in §9 by taking 


Dn Dn 
T= 1, qm)’ 


k(2g4+1 +1) 
T* = = Pm+k(2o ), 
Ym+k(2g+1)  Ym+k(29+1) 


where these matrices are the same as those defined in the proof of the last theo- 
rem. The equation for g; and g: becomes 

=0, 
and its two roots satisfy 


lele=1, = Po. 


By (27), 
PmP + Pm = 
7 Um g+1) + 


Pm+k(29-+1) — = 
+ 


Here | P"gz |p S 1/P, and | q;, |p = 1 by construction; hence 


PmP~ + Dm 


ImP + dm 


fal | 

a is | 
(4 
| if 
| to 
a T 
| 
| 

| I 
| 
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is a P-adic integer, for which 
QmP + qm 


+k(29+1)} 
29 


i.e. ¢ is the number of Theorem 13. 
All points 2m4429+1 lie on a circle K, whose points z satisfy the inequalities 


since f; is an inner point of K and since 3(fi) < P, K must have an are C, for 
whose points ¥(z) S YP. Now the multiplicator # of Q is not a root of unity, 
if, as we assume, g is sufficiently large. Hence there are arbitrarily large integer 
values of k, for which 2m+429+1) lies on C, so that the theorem follows immediately. 

The number ¢ of the last two theorems had the property that all elements of 


Q(¢) with sufficiently large indices were either equal to + ‘a or belonged 


to M’’(P). This is not accidental, as the following theorem shows: 
THEOREM 15: If an infinity of elements of Q(¢) belong to M'(P), but are different 


jrom = (9, 


4P — 1) 
To every € > 0, there is a number ¢ with this property, for which 
mie 
0 P 
Proor: Let = +( ) betong to M’'(P), so that 
Brit Bait 0O 
and 
(Bn412n — + = —P. 
If | Bry: | 2 2, then, as in §18, 
1) 
| | 2 


If | Bria |= 1, 0, then | — An+1 | 2 4, 
| — | 2 3; since min (| z,|, | 2n41 |) VP, 


— 1)! 


min (Yn Yn+1) 


Finally, if | Bayi | = 1 and an4181.41 = 0, then Q,41 is one of the matrices 


j 


| 
‘ 
| 
| 
4 
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where a is an integer. Of these matrices, + ee “ and + - do not 


belong to M(P). If = + ae 


hence 


), then a & Oand z, = —P/(én41 — a); 


Yn = < 
— a)? + ~ 14+ 


since | n41 — a| = 3, and therefore 
_ (4P — 1) —2(4P _ 4P - 1 ) 

Ya * 2 ~ — 
(4P — 


so that again min (Yn, Ynu1) S A similar proof applies to 2,4; = 


_~{-a l 
The second part of the Theorem is obtained immediately from Theorem 5, by 
0 P 
taking Q = 4; 


20. An application of the preceding results 
For complex z = x + yt, let 


+ ||)? + y) 


and, if ¢ is a P-adic integer, define 
T(¢) = lim inf ¢(z,). 


The results in §§17-19 lead to the following 
THEorEM 16: If ¢ is rational, then 


= @. 
If ¢ ts irrational, then for an infinity of indices 


P+1 
t(zn) 
and therefore 

P+1 
4./P 


Finally, to every « >. 0, there is a P-adic integer ¢(which evidently must be irra- 
tional), for which 


TS) 


P+i1 
4,/P 


| | 
| 
| 
| (x 
al 
| 
| in 
| 
{ sil 
to 
| fo 
fo 
} 
| as 
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Proor: The first part of the theorem follows immediately from the corollary 
to Theorem 11, since z, is bounded. 
Now let ¢ be irrational. The identity 


shows that for all z in F 

and therefore 

4y 
The function (1 + y’)/4y has its minimum at y = 1, and increases with increas- 
ing|y—1|. Hence, by Theorem 12, for an infinity of n 


1+(¥) P+1 
2 


since (P + 1)/4/P increases with P, and 


7 < 3 
8/3 ~ 42° 
The last part of the theorem follows from the proof of Theorem 13, by which, 
to every 6 > 0 there is a P-adic integer ¢ such that 


V-P| 


t(z) 


for an infinity of n, and 


for all other sufficiently large indices. For, if 6 is chosen sufficiently small, then 
for the indices of the first class 


P+1 < 


t(zn) — |= €, 


and for those of the second class 


as was to be proved.— 
For P < 19, the expression (P + 1)/4/P is given in the following table: 
P 2 3 5 7 11 13 17 19 


3 1 3 2 3 7 9 5 
(P+ 1)/4vP 4/2 VB V7 Vil 2/13 V19 


t(zn) max 
| 
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If ¢ satisfies the conditions of Theorem 15, then the upper bound for 7(¢) 
can be improved to 

4P +3 
8(4P — 1)?’ 


but this is probably not the best possible result. 
Frnau Remark: It is clear that if f(z, y) is an arbitrary function, then the 
limits 


TS) 


lim inf f(an, yn), lim sup f(2n, Yn) 
can be investigated in the same way as for the special cases we have considered. 
It would be useful to consider in particular the cases f(z, y) = x/y and f(z, y) = 

2 
Me a ; the formulae (52) in the next paragraph show why these functions are of 


interest. 


Part II: DIiopHANTINE APPROXIMATIONS IN THE FIELD OF THE 
P-Apic NUMBERS 


21. Introduction of a binary quadratic form 


We shall now use the results of the first part to obtain theorems about Diophan- 
tine Approximations in the P-adic field. This problem is connected in a very 
simple way with the properties of the representative z(¢) by means of the theory 
of positive definite binary and ternary quadratic forms. 

As usual, if z = x + yzis an arbitrary complex number, we denote its complex 
conjugate number by a bar: Z = x — yi. Let \ be the same number as in the 
first part, and ®(X, Y) the positive definite binary quadratic form 


2(X — rxY)(X — xY) 


@(X, Y) = 


= AX’ + 2BXY + CY’. 


Obviously its determinant 


(A — 2)? 


and since } lies in F, @ is a reduced form. In particular 


#(X,Y) = Y’, if \ = i, 


+ XY + Y’) 


_ 
v3 


#(X, Y) = 5 


For n = 0, 1, 2, --- put 

(49): ,(X, Y) = &(p,.X + pry, + Y) = A,X? + 2B, XY + C,Y’, 
so that the determinant of ®, is 


(50): — A,C, = —P”’. 
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By formula (8) in §2, 


(51): ©,(X, Y) = — znY)(X — 2,Y), 
and therefore from (50), if 2. = 2n + Ynt, 
n n 2 
Yn Yn Yn 


On the other hand, from (49): 
An = ®(Pn Qn); 

(53): B, = Apspn + + Dadn) + Congr 
C, = , 


By construction, z, lies in F. Hence all forms 


&,(X, Y) (n = 0, 1,2,---) 
are reduced, so that 
(54): -A, $2B,< A, SC,, P™ SAC, A. 


Therefore, in particular: 
THEOREM 17: For any P-adic integer ¢ and for every positive integer n, there are 
two integers p, and q, , such that 


| Pn + Gof |p SP", 0 < Qn) 


22. The P-adic analogue to the Theorem of Lagrange 
The following considerations depend on the identity 


(55): + pr Y, qnX = {X? — + + 


which follows immediately from (52) and (53), and on the inequality 
(56): | XY’ — X’Y ? s Y)@(X’, Y’), 
which is obvious from the identity 
(AC — B’)(XY’ — 
= (X, Y)®(X’, Y’) — {AXX’ + B(XY’ + X’Y) + cYY’}’. 


THEeorEM 18: Let ¢ be a P-adic integer, T, = (* Ps) the element of T(¢) 


n Qn 
with index n, and p, q two integers, such that 
lp+a|lpsP", (p,q) >0. 


i 
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Then 
®(p, q) 2 P(Pn, In), 
with equality if and only if either 
|zn| > 1 and p= npn, = 19n; (n = +1), 


v3 
2 

P= "Pr; q = 9, 


Yn > and P = NPr; 


2 (n 


P= Mr, Y= Mr, OTF P= Pa— Pn), = — Qn); 


en 


Proor: Suppose that 
lp +@|p SP", < &(p,q) S 
Then, from (56), 
| Pan — Dad |” S P(Pn, In) In) S$ P", 
| — | Gn) In) S $ 
while on the other hand 
| — Dad |r 
= |qn(p + a6) — (pn + ang) |p S max (|p + gf |e, | Dn + |p) SP’, 
| Pnd — PQn |p 
= | + anf) — + 95) |p max (| pa + |p, |p + |r) SP. 


Therefore each of the two numbers 


Pn — PnQn 


is equal either to 0, or to +1, or to —1. Now obviously 
so that one of the following four one occur, where 7 = +1: 
a) = = b) p = q = 
P= (Px + Pn), = + d) — Pn), = Gn — In): 


\ 
ia 
| 
| Zn | q= Or 
= 
= +1). 
| 
i 
| h 
c 
| 
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The first case is trivial. In case b), from (52), 


t+ yn) 
nr) = — 
(Pus Qn) 


®(p, q) = gn) = 


so that necessarily | z,| = 1. In case c), from (55), 
n 


which is impossible for all points z, in F. Finally in case d), from (55), 


n 2 2 
P (1 + In + yn) < Gn) 


= — Pay In — In) = 


which requires that z, = pa eh 


23. P-adic analogues to the Theorem of Hurwitz-Borel 


The results in §§12-16, the formulae in the last two paragraphs, and Theorem 
18 give immediately the following theorems, which form improvements on 


Theorem 17: 
TuHEorEM 19: A) To every diadic integer ¢ and for at least one of any three 
consecutive indices n, there are two integers pn, Qn, for which 


+ O< O(pr, qn) S 
B) To every « > 0, there is a diadic integer ¢, such that 
—n 2 
2", 0< (3, -«).2 


has no integer solution p, q for sufficiently large n. 
THEOREM 20: A) To every triadic integer ¢ and for at least one of any three 


consecutive indices n, there are two integers Pn, Gn, for which 


1 


B) To every « > 0, there is a triadic integer ¢, such that 


1 
Ip <3”, 0< 


has no integer solution p, q for sufficiently large n. 
THEOREM 21: A) To every pentadic integer ¢ and for at least one of any two 
consecutive indices n, there are two integers Pn, Qn, for which 


| 
) | | 
| 
| 
i 
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B) 1'0 every « > 0, there is a pentadic integer ¢, such that 
0 < (p,q) S (1 — 


has no integer solution p, q for sufficiently large n. 
THEOREM 22: If P = 1 (mod 6), then to every « > 0 there is a P-adic integer ¢, 
such that 


lp+q@|psP", O0< (2, - 


has no integer solution p, q for sufficiently large n. 
Probably for any prime number P and for any P-adic integer ¢ 


lim inf 

®(p,q)>0 


where Y(P) is the arithmetical function defined in §11; but so far I could prove 
this only for P = 7. That 1/Y(P) cannot, in general, be replaced by any 
smaller number, follows from Theorem 7. 


24. The P-adic analogue to the Theorem of Khintchine 


The results in §§18-19, the formulae in §21, and Theorem 18 lead immediately 
to the following results: 
THEOREM 23: A) To every irrational P-adic integer ¢ there is an infinity of 
indices n for which the conditions 
0< (p,q) < 
have no integer solution p, q. 


B) To every « > 0, there is an irrational P-adic integer £, such that for all 
sufficiently large n there are two integers pn, Yn, for which 


lpn + ont lp SP, 0 < ®(pn, dn) S (Jp 


VP 
while for an infinity of indices 


THEOREM 24: A) If an infinity of elements of Q(¢) belong to M'(P), but are 
different from + tid 4 , then there is an infinity of indices n for which the in- 
equalities 


2P” 


=P", 0 < 2,9) < 


have no integer solution p, g.— 
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B) To every « > 0, there is a P-adic integer ¢ with the last property, such that 
for all sufficiently large n there are two integers pn , qn , for which 


25. The P-adic analogue to the Theorem of Tchebycheff 


Ch. Hermite proved the following theorem: 
THEOREM 25: Let 


o(X, Y) = AX’ + 2BXY + CY’ 


be a reduced positive definite quadratic form, Xo and Yo two real numbers. Then 
there are two integers X and Y, such that 


AC(A — 2|B| +0) 


— X,Y —YVo) 4(AC — B) | 


By means of this lemma and the results in §20, we shall prove: : 
THEOREM 26: Let ¢ and # be two P-adic integers, of which ¢ is irrational. : 
Then there is an infinity of indices n, for which there are two integers un and v, , : 
such that i 


—n P+1 an 


Proor: For every index n let E, be the integer defined by 
SP", 0s P" -1. 
By Theorem 25, there are two integers X, and Y,, such that 


4(A,C, — Bi) 


Xn + Dn Y, E,, Qn Xn + In 


since 
®(p,X + prY, + = ©,(X, Y) = + 2B,XY + 
is a reduced positive definite form. Obviously, from (52), 


AnCu(An — + Cs) _ ps (en + ya) (1 = | + Ys) 
4(A,C,, 4yn 
hence, by Theorem 16, for an infinite sequence of indices n 


P+1 yn 
+ — E,, QnXn + Gn Yn) 


10 Hermite, |. c. 6, 94-99. 


~ 
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Now put 
= + Wn = Vales + Un = P"Wn = Ann 
where A,, are defined as in §§1-2. Since 


Pn = — Pn = — Ann 


obviously 
Un = + rnYn) An(QnXn + x) E, DnXn + E,, 


and therefore 
P+1 


On the other hand 
Un + Anvn + E, = , i.e. |un + fn SP", 
so that wu, and v, have the required properties.— 


If Q(¢) contains an infinity of elements of M’(P) different from #(9, a" 


then the constant (P + 1)/4./P in the last theorem can be replaced by the 
smaller number (4P + 3)/4(4P — 1)'. It is nearly certain that both these 
constants are not the best possible ones; so far, however, I have not been able 
to obtain the true values. 


UNIVERSITY OF MANCHESTER, 
MANCHESTER, ENGLAND. 
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TABLE OF REDUCED POSITIVE QUATERNARY QUADRATIC FORMS 
By 8. B. Townes 
(Received March 7, 1939) 


Charve’ published a table of reduced positive quaternary quadratic forms, 
which, however, contains several pairs of equivalent forms. In 1936 the author 
found conditions for unique reduced forms according to the Eisenstein method.” 
The forms in the following table obey these conditions. 

Forms in different genera are separated by semicolons, those in different 
orders by colons, properly and improperly primitive forms by periods. The 
symbol (abcdrstlmn) represents the form of matrix 


t brm). 
lL mnd 


a sequence of equal coefficients is indicated by a subscript. The table contains 
all reduced forms of determinant D < 25, with the exception of the only im- 
primitive form (2 2 2 2) of determinant 16. 


D 

1 

2 (1112). 

3 (1118) ;(11220,1). 

4 (1114);(122210;1.) ;(1122). (22221,;0). 

5 (1115);(11230;,1). (2222 1,). 

6 (1116), (122210;) ; (1123). 

7 (1117), (1124051) ; (12231 0; 1,). 

8 (1118) ;(1133051) ; (1223 04 12) ; (2223 1,0) : (1124) :(122 2). 
(2222 150s). 

9 (1119), (2223 1) ; (1125051): (1133) ; (122310). 
(2 2220. 1 Oe 1). 

10 (11110), (1223010) ; (1125), (122410; 12). 

11 (11111), (11340; 1); (11260, 1), (2223 02 1,). 

12 (11112) , (22231303) ; (1134) , (1224012) ; (12331051) ; 


(1224105) : (1126) ; (1223) : (1144052) ; (222305 1s). 
(222202103) ; (22241; 0). 

(11118), (12251031) ; (1127051), (1233101). 
(2224 1g). 


1Comptes Rendus (Paris), vol. 96 (1883), p. 773. 
* Doctor’s dissertation, Chicago, 1936 (unpublished). 
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18 


19 


21 


24 


25 


(11114), (1127), (1224010) ; (11350; 1) , (2223 0 1;0). 
(11115), (1144051) , (2223 02.1021) ; (1233 10s); 
(1225105) , (2233 1,0) ; (1135), (1128051). 

(111 16), (122504 1s) ; (1226105 12) ; (11440; 2), 
(2225150) ; (2233 Ie) : (1128), (22230; 120) ; (1233051) : 
(1144), (2233 12013) : (1224) : (1333 —1 0; 1s). 

(2 22 4 15 0s). 
(11117),(1129051) , (222515) ; (1136051) , (123410; 1,). 
(2224 Os 14). 

(11118), (12250,10) , (2233 13021) ; (1129), 
(123410,1) : (1136), (2223 0.105) ; (1233), (122610). 
(11119), (1145051), (12340, ; (112100; 1), 
(1227103 12) , (2233120101). 

(11120), (1145) ;(123410s) , (2233 1.03 1) ; (12260, , 
(22251303) ;(11370;1), (124410312) :(11210), 
(222303102) ; (1225), (11460; 2) : (133310351). 
(22240. 130) : (22240; 13) ; (22261;0). 

(11121) , (22331303) ; (1137), (1234010), 
(223301021) ;(112110;1), (133310310) ; (11550; 2), 
(1227105) . (224021021) ; (2226 

(11122) ,(12260,10), (1228103 1.) , (12351035 1s) ; 
(11211), (12340; 1) , (2234 1,0). 

(111 23), 

(222502 1s) ;(123510,1), (223416). 
(11124) , (22330.10.1) ;(133310,) , (2234 1201s) ; 
(1228105) , (22271;0) ;(1138), (1334 —1 0s 1) ; 
(12270412) , (2333 —1 -11012) ; (1155051) ; (11470; 2), 
(12441053 1:) ; (2233 1.0,) : (11212), (1234) ; (1146), 
(2233010101): (1226) ; (2223), (12440;2). 
(222402103) ; (222403 1,0) : (2226 130s) . 

(111 25) , (123504 1s) , (2227 Ie) ; (112130; 1), 
(1229105 1s) , (2234 13021) : (1155), (223301010); 
(1235105). (2244 —1s14). 


THE UNIVERSITY OF OKLAHOMA, 
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NOTE ON LINEAR FRACTIONAL SUBSTITUTIONS WITH LARGE 
DETERMINANT 


By H. Davenport 
(Received December 30, 1938) 


1 


In the preceding paper, Dr. Mahler has defined an arithmetical function Y (P) 
for any prime P. Let a, B, a’, B’ be any integers satisfying af’ — a’B = P, 
such that one of the fixed points f of the substitution 


(1) Z= 

lies in the fundamental domain of the modular group. Then Y(P) is defined by 
¥(P) = min 

for all such substitutions. The object of this note is to prove that 

(2) Y(P) = + 


as P— ©,foranye>0. More generally, I prove the following 

THEOREM. Let z be any complex number other than 0. If P is a large positive 
integer, there exists a substitution (1) of determinant P such that one of its fixed 
points f satisfies 


(3) f = 2+ 


Moreover, the signs of R(f) — R(zo) and of | f| — | 2| may be prescribed ar- 
bitrarily. 

If we take 2 = 4 + 34/37 in this theorem, and prescribe R(f) — R(zo) to be 
negative, and | f | — | 20 | to be positive, then f lies in the fundamental domain 
of the modular group, and (3) implies (2). 

The proof of the theorem is an exercise in the use of exponential sums. 


Lemma. Let N be a large positive integer, and let P be an integer relatively prime 
toN. Forany N, , Ne there exist integers u, v satisfying 


(4) uv = P(mod N), 


(5) Ni <u< + No <v < Nz + 
59 
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Proor. For brevity, we write e(t) = e"'’*. The expression 


1 


N z=0 Ny (5/6) +e 


(5/6)+e 


| has the value 1 if or a number congruent to u satisfies Ny < < Ni + 
| and 0 otherwise. Hence the number of solutions of the conditions (4), (5) is 


N—I N-1 N-1 


D e(ai(u — yx) + — y2)), 


N)=1 


a. where Pu’ denotes any integer v satisfying w = P(mod N), and y, ye are 
summed over integers satisfying 


<y < Ni + 


Ne < ye < Ne + NOM, 


The contribution of the terms with 2; = x2 = Ois 


~ 


Let S(a, b) denote the Kloosterman sum 


N-1 


S(a, b) = > e(au + bu). 


The contribution of the terms with 2; = 0, x2 ¥ Ois 


= 1) 80, Pad 


and the contribution )_; of the terms with 2 0, x. = 0 has a similar form. 
The contribution of the terms with 2; ¥ 0, 2. ¥ 0 is 


N-1 N-1 


It is known’ that 


S(a, b) = O(N*(a, N)), 
where on the right a may be replaced by b. Also, if 1 < « < N —1, 


e(—zy) = O (*) 


‘ 1Salié, Math. Zeitschrift, 36 (1932), pp. 263-278. For the case of a prime N, from which 
the general case follows trivially, see also Davenport, Journal fiir Math., 169 (1933), pp. 
158-176. 
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where z’ = min(z, N — 2), and y is summed over any interval. Hence 
N-1 
zq=1 


=o(n** > 4) 


m<Nid md 


where d(N) denotes the number of divisors of N. Similarly, > is of the same 
order of magnitude. Finally, f 


= o( log n> (21, N) 


= O(N** log N d(N)). 


The upper bounds for >>; , >>; , >.2 are all of lower order of magnitude than 
o(N)N****, since 
N 


for any « > 0. Hence the number of solutions of (4), (5) is positive. 


3. Proof of the Theorem 


Suppose first that both the prescribed signs are positive. Let r = R(z), 
m=|z |. Let k be any fixed number greater than m. Let 


(6) p= —rt+(? —m+h), 


where the radicals are taken positively. Clearly \ > u > 0. 

Choose 8 to be one of the two integers next less than (P/k)', and to be rela- 
tively prime to P (at least one of any two consecutive integers is prime to P). 
We have 


(7) P>ks, P= ks + 0(8). 
By the Lemma, there exist integers a, 8’ such that 
(8) \B — pelt <a up — <p’ < 
” ap’ = P(mod 8). 


= O(N*** log N d(N)), 
g ¢(N) | 
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By (9), there exists an integer a’ such that 
(10) ap’ — a’'B = P. 
By (8) and (6), 


a — > (A — = 2r8, 
a — = + 
By (7), (8), (10), and (6), 


of’ < dup’, = + itb 
P — af!’ > (k — = mp’, 
ap’ mp” + Di 
Hence 
a — —(1/6) +e —(1/12)+e 

(11) 0< —r=0(6 ) =o ), 
26 ge 
—a’ —(1/6)+¢ —(1/12)-+e Be 
ell 

Since the fixed points of the substitution (1) are the roots of 
Bf — (a — — a’ = 0, 
fri 
we have ein 
Rf 28 lf | B we 
and the theorem follows from (11) and (12). ae 
If the prescribed signs are — for R(f) — R(z) and + for|f| — | 20], it is 
only necessary to replace the intervals (8) by ne 
MB — 28% < < ng — uB — < Bl < up. ge 
If the prescribed sign for | f | — | zo | is negative, we choose 8 to be one of the ‘ : 
two integers next greater than (P/k)', and take either the intervals ’ 
\B or <a< + < B’ < up Ni 
or the intervals “i 
Pe 
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BEWEIS EINER CARATHEODORYSCHEN VERMUTUNG. TEIL I 
Von Hans HAMBURGER 
(Received February 4, 1939) 


1. Herr Caratheodory hat gelegentlich die Vermutung gedussert, dass eine 
iiberall regulire geschlossene Flaiche vom Geschlechte Null (mit stetiger Tan- 
gentenebene und stetiger Kriimmung) mindestens zwei Nabelpunkte besitzt. 
Der Beweis dieser Vermutung wird unter gewissen weiter unten angegebenen 
Differenzierbarkeitsvoraussetzungen, die fiir analytische Flachen sicher alle 
erfiillt sind, in der vorliegenden Arbeit und einem spater zu veréffentlichenden 
2. Teile gefiihrt werden. 

Die Caratheodorysche Behauptung, die eine Aussage iiber die Differential- 
geometrie im Grossen enthalt, lasst sich nun aber in einfacher Weise auf eine 
Behauptung iiber das Verhalten der Kriimmungslinien in der Umgebung eines 
isolierten Nabels reduzieren, wenn man den Begriff des topologischen Index 
eines singularen Punktes im Kurvennetz heranzieht. 


2. Die topologischen Eigenschaften eines Kurvennetzes sind vom Verf. schon 
friiher an anderer Stelle’ untersucht worden. Ist auf einem ebenen Bereich oder 
einem Flachenstiick ein Kurvennetz, das sind zwei sich schneidende Kurven- 
scharen, ausgebreitet, so heisst ein Punkt P ein regulirer Punkt des Netzes, 
wenn die Netzkurven, die eine hinreichend kleine Umgebung des Punktes er- 
fiillen, als Bilder zweier sich senkrecht schneidender Scharen von zu einander 
parallelen Geradenstiicken aufgefasst werden kénnen. Ist jetzt S ein isolierter 
singularer innerer Punkt des Netzes, d.h. ist jeder Punkt einer geeignet ge- 
wahlten Umgebung von S ein regulirer Netzpunkt, so lasst sich S durch eine 
geschlossene Jordankurve p mit zwei Eigenschaften umgeben:’ 

erstens: » begrenzt einen einfach zusammenhidngenden Bereich, der ausser S nur 
regulére Punkte des Netzes enthalt; 

zweitens: p setzt sich in seiner Gesamtheit aus endlichen vielen Stiicken von 
Netzbogen zusammen. 

Die Punkte von p, in denen Stiicke verschiedener Netzkurven zusammen- 
stossen, d.h. in denen eine zu p gehérige Kurve der ersten Schar sich mit 
derjenigen Kurve der zweiten Schar schneidet, auf welcher die angrenzende 
Polygonseite gelegen ist, sollen Ecken des Polygonzuges heissen, und zwar 
unterscheide man einspringende von ausspringenden Ecken, je nachdem die 


‘H. Hamburger, ‘“‘Uber Kurvennetze mit isolierten Singularitaten auf geschlossenen 
Flachen.” Math. Zeitschr. 19 (1924), 8. 50-66. Vgl. auch die dort zitierten Arbeiten von 
Henri Poincaré, Fussnote 8; S. 53 und Ivar Bendixson Fussnote 10, 8. 54. 

* H. Hamburger, loc. cit. Fussnote 1; 8. 59. Abschnitt 13. 
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Fortsetzung der beiden in der Ecke zusammenstossenden Netzbogen tiber die 
Ecke hinaus sich beide gleichzeitig ins Innere von p hinein fortsetzen lassen 
oder nicht. Bezeichnet i die Differenz der Anzahl aller ausspringenden Ecken 
von » vermindert um die Anzahl aller einspringenden Ecken von )p, so zeigt 
sich, dass, obgleich sich ein Polygonzug p mit den geforderten zwei Eigenschaften 
in unendlich mannigfacher Weise konstruieren lasst, alle diese Polygonziige 
auf die gleiche Zahl 7 fihren. Mithin ist 7 eine fiir die topologische Singularitat S$ 
invariante charakteristische Zahl, die nunmehr S als Index zugeordnet werde.’ 


3. Erfiillt jetzt das Kurvennetz vollstandig eine geschlossene zweiseitige 
Flache vom Geschlechte p, wobei alle Flachenpunkte regulaire Netzpunkte sind, 
ausgenommen k singulire Punkte S, (x = 1, 2, .--, k), mit zugehérigen Indices 
i, , 80 gilt die fundamentale Formel 


k 
U 94 
(1) z ( = ) = 2p — 2. 


4. Man denke sich jetzt auf einer geschlossenen zweiseitigen Flaiche vom 
Geschlecht p mit stetiger Tangentenebene und stetiger Kriimmung das System 
der Kriimmungslinien konstruiert ; wir erhalten so ein topologisches Kurvennetz, 
dessen einzige topologischen Singularitaten offenbar die Nabel sind. 

Aus Formel (1) folgen jetzt unmittelbar fiir geschlossene zweiseitige Flachen 
vom Geschlechte Null die beiden Tatsachen: 

I. Es existiert keine eschlossene Flache vom Geschlechte 0 ganz ohne Nabel.’ 

Denn bei einer solchen Flache wiirde die Summe auf der linken Seite von (1) 
gleich null werden, wahrend die rechte Seite von (1) gleich —2 ist. 

II. Wenn eine geschlossene Fliche vom Geschlechte 0 mit einem einzigen Nabel 
existieren wiirde so miisste der Index i dieses Nabels gleich —4 sein; denn fiir eine 
solche Flache folgt aus (1) 


; 
—2,d.h. —4. 


Die Caratheodorysche Vermutung ersetzen wir nunmehr durch die viel weiter- 
gehende Behauptung: 


(2) i20°; 


3 Uber andere Arten, den Index der isolierten Singularitaét eines Kurvennetzes oder 
einer einparametrigen Kurvenschar einzufiihren, insbesondere iiber die Definitionen von 
H. Poincaré, I. Bendixson und Hellmuth Kneser, vgl. den Bericht der in Fussnote 1; 
angegebenen Arbeit des Verf. 8S. 53-57. Vgl. auch dort die zahlreichen Fussnoten, die 
genaure Literaturangaben enthalten. 

‘ Ein einfacher Beweis dieser Formel unter sehr allgemeinen Voraussetzungen findet 
sich 1. c. Fussnote 1; 8. 58-62. 

5 Ein direkter Beweis (ohne Heranziehung von Formel (1)) des Satzes, dass kein singu- 
laritétenfreies Kurvennetz existiert, welches eine Flache vom Geschlechte 0 vollstandig 
tiberdeckt, findet sich in der Arbeit des Verf.: ‘Ein Satz tiber Kurvennetze auf geschlos- 
senen Flachen.’”’ Sitzber. d. Preuss. Akad. d. Wiss. zu Berlin 21 (1922), S. 258-262. 

° Es sei hier eine einfache und anschauliche Definition des Index einer Singularitat 
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denn mit Ricksicht auf die eben formulierte Tatsache II folgt die Caratheo- 
dorysche Vermutung unmittelbar aus (2). 


5. Um eine méglichst einfache Form fiir die Differentialgleichung der Kriim- 
mungslinien zu erhalten, gehen wir von einer Darstellung der vorgelegten Flache 
§ aus, die wir in Kap. II §5 der vorliegenden Arbeit ableiten werden. Bezeich- 
nen 2, y, z die drei rechtwinkligen kartesischen Koordinaten eines Punktes auf §, 


so setze man 


We 

We 

2W 

1+Wit+ We 


Hierbei sind u, v speziell gewahlte Gaussche Flachenparameter, W ist eine 
dreimal stetig differentiierbare Funktion von u und v; W,, We, Wau usw. 
bezeichnen in iiblicher Weise die partiellen Ableitungen von W(u, v). Durch 
die Funktion W(u, v) ist somit die Fliche § eindeutig bestimmt. 

Eine geometrische Deutung dieser Flachendarstellung erhalt man, wenn man 
die Abbildung betrachtet, welche den durch die Formeln (3) bestimmten 
Flachenpunkt in den Punkt der Ebene mit den rechtwinkligen Koordinaten 
u, v tiber fiihrt;’ dann existiert eine Kugel vom Radius W und dem Mittelpunkt 
r= u,y =v,2 = W, welche gleichzeitig § im Punkte (3) und die Ebene z = 0 
im Punkte z = u, y = v berihrt, oder anders, noch anschaulicher ausgedriickt: 
Spiegeln wir das Normalensystem von § an den Tangentialebenen der Flache 
mit den Koordinaten z = u, y = v, 2 = W, so wird dieses Normalsystem in 
eine Schar von zur z-Achse parallelen Geraden iibergefiihrt, insbesondere wird 


fiir den Fall erwahnt, dass das Kurvennetz, wie z. B. bei den Kriimmungslinien durch cine 
Differentialgleichung 1. Ordnung definiert ist: 

Bezeichnet w den Zuwachs des Winkels einer Integral-kurve, den man bei cinem 
geschlossenen Umlauf um den Nabel erhalt, und setzt man w = jz, so besteht zwischen j 
und i der Zusammenhang j = 2 — 1/2. : 

Vgl. z. B. H. Hopf und H. Samelson: ‘‘Zum Beweise des Kongruenzsatzes fiir Eiflachen.”’ 
Math. Zeitschr. 48 (1938), S. 749-766. Siehe insbesondere 8S. 750. 

Der Behauptung (2) i = 0 entspricht dann j S 2. 

Endlich sei noch auf einen Satz von Herrn Schilt tiber Flachpunkte hingewiesen, in die 
man ja jeden Nabel durch eine geeignete Inversion (Transformation durch reziproke 
Radien) tberfiihren kann: Besitzt jeder Flichenpunkt in der Umgebung eines isolierten 
Flachpunktes negative Kriimmung, so ist immer j < 0, d.h. somit i 2 2. Vgl. H. Schilt, 
“Uber die isolierten Nullstellen der Flachenkriimmung und einige Verbiegbarkeitssitze.” 
Compos. Math. 6 (1938), S. 239. Siehe insbesondere Nr. 18. 

Vgl. auch die oben zitierte Arbeit von H. Hopf und H. Samelson 8. 756. 

’ Eine ausfiihrliche Ableitung der Darstellung (3) vermittels Ribancourtransformation 
findet sich in der Arbeit des Verf. ‘“Ribancourtransformationen und spharische Abbildung.”’ 
Acta Math. 62 (1933), 8. 1-90. Vgl. insbesondere S. 44~46. 
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ung (2) in Angriff nehmen, genau zu formulieren, substituieren wir in der 
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hierbei die Flichen-normale im Punkte (3) in eine Gerade verwandelt, die auf 
der Ebene z = 0 im Punkte x = u, y = v senkrecht steht. (vgl. Fig. 4. und 
die ausfiihrliche Ableitung in Kap. II §5). 

6. Wie in Kap. II §6 gezeigt wird, ist die Differentialgleichung der Kriim- 
mungslinien der durch die Formeln (3) dargestellten Flache 


(4) Wav (du? — dv?) + (Wo — Wau) du dv = 0. 


Diese Differentialgleichung hat vor allem die einfache Eigenschaft, dass, wenn 
man ihre Integralkurven in der u, v Ebene deutet, diese Kurven ein ebenes 
orthogonales Netz bilden. 

Indem man die Differentialgleichung (4) oder vielmehr eine aus ihr durch 
Kinfiihrung von Polarkoordinaten abgeleitete Differentialgleichung (vgl. Formel 
(8)) zugrundelegt, lisst sich der Index 7 einer Singularitét von (4) nach einer 
einfachen allgemeinen Methode bestimmen, welche in Kapitel I auseinander- 
gesetzt ist. 


7. Umendlich die Voraussetzungen, unter denen wir den Beweis der Behaupt- 


Funktion W (u, v) die Polarkoordinaten 


u = pcos 8, v = psin 0. 


Wir fordern nunmehr, dass die vorgelegte Flaiche § durch den Nullpunkt geht, 
dort einen Nabel hat und die x, y Ebene als Tangentialebene besitzt. Analytisch 
driicken wir diese Tatsache dadurch aus, dass wir verlangen, W sei in der 
Umgebung des Nabels von der Form 


(5) W(p, 0) = + p™(wo(d) + F(p, 6)). 


Hierbei seien wo(6) und F(p, 6) fir 0 p S$ 6, — + © dreimal stetig 
differentiierbare Funktionen, die in 6 offenbar die Periode 2x haben. Ausserdem 
sei 


lim F'(p, 6) — 0, 


p—0 


wihrend wo(@) nicht identisch verschwinden mége; endlich bezeichne m eine 
positive Zahl > 2, R eine positive Konstante, welche gleich dem Radius der die 
Flache § im Nabel beriihrenden Schmiegungskugel ist, wihrend m — 1 gleich 
der Ordnung der Beriihrung zwischen § und der Schmiegungskugel ist.* 


§ Der Verlauf der Kriimmungslinien in der Umgebung eines Nabelpunktes ist schon 
mehrfach, u.a. von Cayley, Darboux und E. Picard untersucht worden. Alle diese 
Autoren behandeln aber nur den Fall m = 3 unter einer Reihe weiterer einschrankender 
Voraussetzungen. Erschépfend ist der Fall m = 3 erst in einer langeren Abhandlung 
von A. Gullstrand, “Zur Kenntnis der Kreispunkte,’’ Acta Math. 29 (1905), S. 59-100 
dargestellt worden. Ebenda ist auch der Fall m = 4, allerdings unter einschriinkenden 
Voraussetzungen in Angriff genommen; fiir allgemeines m beschrinkt sich Gullstrand nur 
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Im Teil I Kap. III wird nun, gestiitzt auf die in Kap. I zur Bestimmung des 
Index i entwickelte Methode, der Beweis der Behauptung 7 = 0 gefiihrt unter 
der Voraussetzung, dass 

I. die Funktion dwo(@)/dé nur isolierte Nullstellen hat, 

Il. die Funktion wo(0) nur Nullstellen erster und zweiter Ordnung, aber keine 
Nullstellen dritter und héherer Ordnung besitzt. 

Es gelingt hier, die Abschaétzung 7 2 0 sogar in einer noch schirferen Form 
zu beweisen, nimlich: Unter den Voraussetzungen I und II lisst sich um den 
Nabel immer ein aus Stiicken von Kriimmungslinien zusammengesetzter geschlos- 
sener Polygonzug » konstruieren, welcher tiberhaupt keine einspringenden Ecken 
enthalt; ausgenommen ist hierbei der Fall, dass die eine Schar von Kriimmungs- 
linien sich spiralférmig um den Nabel als asymptotischem Punkt windet, dann 
lisst sich der geschlossene Polygonzug » so konstruieren, dass er genau eine 
einspringende und eine ausspringende Ecke enthilt. 

Im Teil II wird auf die Voraussetzung II verzichtet, d.h. es werden Funk- 
tionen wo(@) zugelassen, welche im Intervall 0 < @ S 27 endlich viele Null- 
stellen beliebig hoher Ordnung besitzen. Unter einigen weiteren Voraus- 
setzungen, die gewiss alle erfiillt sind, wenn nur die Flache im Nabelpunkt und 
seiner Umgebung analytisch ist, wird dort die Behauptung 7 = 0 bewiesen 
werden. Hierzu sind wesentlich tiefer liegende Untersuchungen erforderlich 
als diejenigen, welche wir hier in Teil I durchzufiihren haben.’) 


KapireL I. ENTWICKLUNG EINER ALLGEMEINEN METHODE, UM DEN INDEX 
DER ISOLIERTEN SINGULARITAT EINER DIFFERENTIALGLEICHUNG ERSTER 
ORDNUNG ZU BESTIMMEN, DEREN INTEGRALKURVEN EIN EBENES 
ORTHOGONALFS KURVENSYSTEM BILDEN 


§1. Einfiihrung von Polarkoordinaten. Voraussetzungen iiber die Koeffizienten 


der Differentialgleichung 
8. In der u, v Ebene sei eine Differentialgleichung 
(6) X(du? — dv’) + 2Y dudv = 0 


vorgelegt, deren Koeffizienten X, Y in der Umgebung des Nullpunktes der u, v 
Ebene definiert sind und dort einer Lipschitz-Bedingung geniigen. X und Y 
mégen ausserdem fir u = 0, v = 0 gleichzeitig verschwinden, wahrend sie in 
einer gewissen Umgebung des Nullpunktes der u, v Ebene keine gemeinsame 
Nullstelle ausser dem Nullpunkt haben. 


auf wenige Bemerkungen. Genauere Literaturangaben finden sich bei Darboux, Legons 
sur la théorie générale des surfaces IV (1896), S. 448, und in der zitierten Arbeit von Gull- 
strand S. 59 und 60. Uber die Bedeutung der in der Darstellung (5) enthaltenen Vor- 
aussetzungen iiber die Fliche § verg. Fussnote 16 §7. 

® Die Uberlegungen und Beweise des I. Teiles hat Verf. bereits im Jahre 1923 gefunden 
und dariiber in der Berliner Mathemat. Gesellschaft in der 210. Sitzung am 26. 3. 1924 
vorgetragen. Die Untersuchungen zum II. Teil hat Verf. Okt. 1937 begonnen und Sept. 
1938 abgeschlossen. 
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Dann bilden die Integralkurven der Differentialgleichung (6) offenbar in der 
Umgebung des Nullpunktes der u, » Ebene ein orthogonales Kurvensystem mit 


dem Nullpunkte als einzige Singularitat. 


9. Wir fiihren Polarkoordinaten ein 


(7) u = pcos 8, v = psin @. 


Dann wird 
du? — dv’ = cos 20-(dp’ — dé’) — 2 sin 26-p dp dé. 
2 du dv = sin 20-(dp’ — p' dé’) + 2 cos 20-p dp dé, 


und fiir (6) erhalt man 
(8) A(dp’ — p’ d0’) + 2Bp dp dé = 0, 


wobei 


(9) 


A = X cos 26 + Y sin 26, 
B= — X sin 20+ Y cos 20 


gesetzt ist. 


10. Aus den Voraussetzungen fiir X und Y folgt fir A und B: 
1. A und B verschwinden beide fiir p = 0; 

2. in einer gewissen einseitigen Umgebung Ui der Geraden p = 0 in der p- 0-Ebene 
(p > 0) konnen A und B nicht gleichzeitig verschwinden; 

3. A und B haben beide in 0 die Periode 27; 

4. A und B geniigen in U einer Lipschite-Bedingung. 

Aus der Voraussetzung 4. folgt unmittelbar, dass fiir p > 0 durch jeden 
Punkt von U zwei und nur zwei Integralkurven laufen, welche der Differential- 
gleichung (8) geniigen. Weiter setzen wir fiir A und B voraus: 

5. Die Umgebung Ul der Geraden p = 0 lasse sich so klein wahlen, dass der 
Koeffizient A dort nur auf isolierten stetigen Kurven c, verschwindet, welche sémt- 
lich einen bestimmten Punkt 0 = 06, der Geraden p = 0 mit einem Punkte des 
Randes von \ verbinden, derart dass zwei beliebige Kurven c, und ¢, héchstens auf 
der Geraden p = 0, aber nicht im Innern von UU, einen gemeinsamen Schnittpunkt 
haben. Endlich sei N die Anzahl der Kurven ¢, (v = 1, 2,--- ,N), deren An- 
fangspunkte 6 = 06, auf der Geraden p = 0 sétimtlich im Periodenstreifen 0 S 
AS&S--- S O41 S +--+ S Ov < gelegen sind. 


§2. Geometrische Betrachtungen iiber die Integralkurven in der Nahe der 
Kurven ¢, 


11. Die Auflésung der quadratischen Gleichung (8) ergibt 


d 
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In einer hinreichenden Nahe der in Voraussetzung 5. definierten Kurven ¢, 
liisst sich, solange A/B < 1, die Quadratwurzel in eine konvergente Potenz- 
reihe entwickeln, und man erhalt 


Hieraus folgt, dass fiir das Vorzeichen, wenn A hinreichend klein bleibt, das 
erste Glied der Entwicklung den Ausschlag gibt: 


(12) se (4) = 04, (%) = = 


12. Nunmehr unterscheiden wir drei Arten von Kurven: Wir sagen, ¢, ist von 
erster Art (kurz eine Kurve c,), wenn A/B beim Uberschreiten von c, mit wach- 
sendem @ von negativen zu positiven Werten iibergeht, 

¢, is von zweiter Art (kurz eine Kurve c,), wenn A/B beim Uberschreiten von 
c, mit wachsendem @ von positiven zu negativen Werten iibergeht, 

¢, ist von dritter Art (kurz eine Kurve c,’), wenn A/B auf beiden Seiten von ¢,” 
das gleiche Vorzeichen hat. 

Wegen (12) hat offenbar diejenige Integralkurve von (8), deren Tangente 
gleich (dp/dé@), ist, in ihrem Schnittpunkt mit c¢, 

ein Minimum, wenn ¢, erster Art ist, 

ein Maximum, wenn ¢, zweiter Art ist, 

einen Wendepunkt, wenn c, dritter Art ist. 


13. Endlich zeigt die Differentialgleichung (8) unmittelbar, dass eine Integral- 
kurve, fir p > 0, ganz gleich ob ihre Tangente (dp/d@), oder (dp/d6)s ist, das 
Vorzeichen ihrer Tangente nur in ihren Schnittpunkten mit den Kurven rod 
oder ¢, wechseln kann, da (dp/dé), und (dp/dé@)2 nur in solehen Punkten 0 baw. 
werden, wo A = O ist. 


§3. Beschreibung der Methode zur Bestimmung des Index 


14. Von den in Voraussetzung 5. §1 beschriebenen N Kurven ¢, lassen wir 
jetzt alle Kurven 3. Art. fort. Dann mégen noch n Kurven ¢, (vy = 1, 2 --- ») 
iibrig bleiben, die simtlich 1. Art oder 2. Art sind, und die wir uns so geordnet 
denken, dass die Kurve ¢,4: ganz rechts von ¢, gelegen ist. Ausserdem erhilt 
man wegen der Periodizitatseigenschaft von A (Voraussetzung 3. §1) die Kurve 
Cny1, indem man ¢ parallel zu sich in der Richtung der 6-Achse um 27 verschiebt. 

Wir betrachten nun den Bereich %, , der von ¢,, ¢,41 dem Stiick des Randes 
von U zwischen ¢, und ¢,,; und dem Stiick 6, S @ S 6,4: (vgl. Voraussetzung 5. 
§1) der Geraden p = 0 begrenzt wird. Falls 6, = 6,4: zieht sich dieses Geraden- 
stiick auf einen Punkt zusammen. Dann kann die Tangente einer beliebigen 
Integralkurve nach der Bemerkung aus Abschnitt 13. ihr Vorzeichen im Innern 
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von %, nirgends wechseln, da zwischen c, und ¢,41 keine weitere Kurve 1. Art 
oder 2. Art gelegen ist. 

Wir wollen jetzt in %, den in der Einleitung Abschnitt 2 beschriebenen Poly- 
gonzug » konstruieren, der sich aus Integralkurvenstiicken zusammensetzt und 
dann zur Bestimmung des Index 7 fiihrt, wobei wir drei Fille zu unterscheiden 
haben. 


15. 1. Fauy. c, und ¢,4, sind beide von 1. Art; wir nennen dann auch den 
zugehérigen Bereich B, von 1. Art. Wir betrachten das Netz der Integralkurven 
von (8) in %,. Dieses Kurvennetz kann man in zwei Scharen aufteilen, und 
zwar rechnen wir eine Integralkurve zur Schar I oder II, je nachdem ihre 
Tangente dp/d@ = 0 oder < O ist. Die Grenzfille dp/d@ = 0 baw. © kénnen 
auch eintreten, da ja 8, noch Kurven c’” im Innern enthalten kann. Offenbar 
kénnen sich zwei Integralkurven der gleichen Schar wegen Voraussetzung 4. 
aus §1 in %, nirgends schneiden. 


Wir betrachten nunmehr ein Integralkurvenstiick f,, das c, schneidet und 
dessen Tangente in der Umgebung von ¢, gleich (dp/d@); wird (vgl. Formel (10)). 
Dann gehért dieses Kurvenstiick zur Schar I, da wegen (12) rechts von 
c, die Tangente (dp/dé), > O wird. Ebenso erkennen wir, dass ein Integralkur- 
venstiick f,,; das mit der Tangente (dp/d@), schneidet, zur Schar II gehort, 
da wegen (12) links von c+: die Tangente (dp/dé),; < 0 wird. 

Fine leichte Uberlegung zeigt uns,” dass sich jetzt ein Polygonzug konstruieren 
lasst, der sich aus einer endlichen geraden Anzahl von Integralkurvenstiicken 
zusammensetzt und einen Punkt von f, mit einem Punkt von f,,; verbindet 
(vgl. Fig. 1.). In diesem Polygonzug werden offenbar die Kurvenstiicke der 
Scharen I und II dauernd abwechseln. Ferner wird man beim Ubergang von 
der Schar I zur Schar II eine ausspringende, beim Ubergang von der Schar II 
zur Schar I eine einspringende Ecke erhalten, sodass in dem konstruierten 
Polygonzug die ausspringenden und einspringenden Ecken immer alternieren. 


1° Vgl. 1. ce. Fussnote 1; 8. 59 Abschn. 13, 
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Da f, zu I und £4: zu II gehért, wird die Folge der Ecken mit einer aussprin- 
genden Ecke beginnen und einer ausspringenden Ecke aufhéren. Die Anzahl 
der ausspringenden Ecken des Polygonzuges, der sich von einem Punkte von 
c, bis zu einem Punkte von O41 erstreckt, ist somit genau um 1 grdésser als die 
Anzahl der einspringenden Ecken, d.h. aber, dass der Polygonzug in einem 
Bereiche 8, von 1. Art fiir den Index 7 den Beitrag + 1 liefert. 


16. 2. Fauu. c, und C41 sind beide von 2. Art; wir nennen dann auch den 
zugehérigen Bereich 8, von 2. Art. Wir unterscheiden ebenso wie beim 1. 
Fall die Integralkurven der Schar I und der Schar II und bemerken, dass jetzt 
das Integralkurvenstiick f,, das c, schneidet und dessen Tangente in der Um- 
gebung von Cy, gleich (dp/dé@), ist, zur Schar II gehért, da rechts von c, wegen 
(12) (dp/dé),; < 0 wird, wihrend das ¢, 41 mit der Tangente (dp/dé),; schneidet, 
zur Schar I gehért. da links von ¢,41 wegen (12) (dp/dé); > 0 wird. 


Fia. 2 


Konstruieren wir jetzt ebenso wie im 1. Fall einen Polygonzug, der sich aus 
endlich vielen Integralkurvenstiicken zusammensetzt und einen Punkt von f, 
mit emem Punkt von f,,; verbindet, so werden auch hier im Polygonzuge einer- 
seits die Kurvenstiicke der Schar I und der Schar II, anderseits die einspringen- 
den und ausspringenden Ecken dauernd wechseln (vgl. Fig. 2). Da f, zur 
Schar II, f,,. zur Schar I gehdrt, so wird die Folge der Ecken des Polygonzuges 
mit einer einspringenden Ecke beginnen und einer einspringenden Ecke aufhéren. 
Die Anzahl der einspringenden Ecken des Polygonzuges, der sich von einem 
Punkte von c, zu einem Punkte von c,4: erstreckt, ist somit genau um 1 grésser 
als die Anzahl der ausspringenden Ecken, d.h. aber, dass der Polygonzug in 
einem Bereiche %, von 2. Art fiir den Index 7 den Beitrag —1 liefert. 


17. 3. Fatu. Von den beiden Kurven c, und ¢,4; ist die eine von 1. Art, die 
andere von 2. Art. Dann sagen wir, der zugehérige Bereich B, sei von 3. Art. 
In diesem Fall miissen die beiden Kurvenstiicke f, und f,,; , die ¢, baw. ¢,.1 mit 
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der Tangente (dp/dé@); schneiden, beide zur gleichen Schar ghéren und zwar 
offenbar beide zur Schar I, wenn ¢, von 1. Art, ¢,4: von 2. Art,—beide zur Schar 
II, wenn c, von 2. Art, ¢,41 von 1. Art ist. 

Konstruiert man jetzt den Polygonzug aus Integralkurvenstiicken, der einen 
Punkt von f, mit einem Punkt von f,,; verbindet, so wird im 3. Falle, da ja 
auch hier wieder im Polygonzug einerseits die Kurvenstiicke der Schar I und der 
Schar II, anderseits aber auch die einspringenden und ausspringenden Ecken 
dauernd wechseln, offenbar die Anzahl der Kurvenstiicke des Polygonzuges 
ungerade und damit die Anzahl der Ecken gerade sein (vgl. Fig 3). D.h. 
aber der Polygonzug enthalt ebenso viel ausspringende wie einspringende 
Ecken. Somit liefert der Polygonzug in einem Bereich %, von 3. Art zum Index i 
den Beitrag 0. 


18. Nunmehr konstruiere man fiir jedes v(v = 1, 2, --- , n, m + 1) ein Integral- 
kurvenstiick f,, das c, mit der Tangente (dp/dé), schneidet, wobei man wegen 


Voraussetzung 3 §1. f,4: aus f, erhalt, indem man f, parallel zu sich in der 
Richtung der 6 Achse um 27 verschiebt, und konstruiere dann weiter in jedem 
der n Bereiche 8, (v = 1, 2, --- , n) einen Polygonzug in der oben angegebenen 
Weise, je nachdem %, ein Bereich 1., 2. oder 3. Art ist. Auf diese Weise 
gelangt man zu einem zusammenhiangenden Polygonzug » der p, 6 Ebene, der 
einen Punkt von f; mit einem Punkt von fn; verbindet. Geht man jetzt von 
der p, 6 Ebene wieder zu der u, v Ebene iiber, indem man die zu (7) inverse 
Transformation ausfihrt, so wird aus p ein geschlossener Polygonzug p* in der u, 
v Ebene, da nach der Transformation das Bild von f,4; mit dem Bild von f; 
zusammenfallt. Ferner gehen die Ecken von p in die Ecken von p* tiber und 
behalten offenbar nach der Transformation ihre Eigenschaft bei, ausspringend 
oder einspringend zu sein. 

19. Der Gesamtindex 7 des Nabels setzt sich mithin zusammen aus den ein- 
zelnen Beitraigen, den jeder der n Bereiche %, zu dem Index ¢ liefert. 

Beriicksichtigt man die Ergenbnisse der drei Abschnitte 15., 16., 17., so ergibt 
sich der Satz: 
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Der Index i des Nabels ist gleich der Differenz aus der Anzahl der Bereiche 
%, von 1. Art vermindert um die Anzahl der Bereiche 8, von 2. Art. 
Es bezeichne jetzt 


{1} die Anzahl der Bereiche %, von 1. Art, 
{2} die Anzahl der Bereiche 8, von 2. Art, 


+} die Anzahl der Kurven ¢, von 1. Art, 


(13) {—} die Anzahl der Kurven ¢, von 2. Art, 


dann ist 
t= {1} — {2}. 
Nun zeigt aber eine leichte Uberlegung, dass 
{1} — {2} = {+} - {-} 
ist. Somit erhalt man die wichtige Schlussformel 
(14) t= {+} —{-}. 
§4. Analytische Hilfsmittel zur Bestimmung des Index 


20. Zu den in §1 angegebenen fiinf Voraussetzungen iiber die Koeffizienten 
A, B der Differentialgleichung (8) fiigen wir jetzt noch zwei weitere hinzu. 

6. Fiir hinreichend kleine p (0 S p S 5) werde jede Kurve ¢,, auch die Kurven 3. 
Art (vgl. Voraussetzung 5 §1), durch die stetige Funktion 6 = 0,(p)(v = 1,2, ---, 
N,N + 1) analytisch dargestellt. 

Dann ist 


> 9,(p) fiir p > 0, 
7. Pir jedes v(v = 1, 2, --- , N) lasse sich A in das Produkt 
(15) A(p, 8) = (8 — 8) 


zerlegen, wobei k, eine ganze positive Zahl bedeutet, und A,(p, 9,(p) Ofiir0<o S 
6 ist. 

Dann ist offenbar: 
¢, von 1. Art, wenn k, ungerade und 


(16) se = +1, 
¢, von 2. Art, wenn k, ungerade und 

A,(p, Or()) _ 
(16’) sg "B(p, = —l, 


¢, von 3. Art, wenn k, gerade ist. 
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Bezeichnet k die grésste der N Zahlen k, und ist A(p, @) k mal partiell nach 6 
stetig differentiierbar, so wird 


6 = O,(p). 


Mit Riicksicht auf die Formeln (16), (16’) und (17) lasst sich der Index i 
statt durch den symbolischen Ausdruck (14) auch durch die Summe 


| a*” A(p, 0) 
(18) i= 9 6) 


6= 0,(p) 


v 


darstellen; hierbei ist die Summation nur iiber diejenigen der N Kurven @ = 
0,(p) zu erstrecken, welche von ungeradem Vielfachheitsgrad k, sind. 

Die Formel (18) wird den Ausgangspunkt unserer Untersuchungen im Teil 
II bilden, wo wir weitergehende analytishce Voraussetzungen als hier im Teil I 
machen werden. 


Kapiret II. ABLEITUNG DER DIFFERENTIALGLEICHUNG DER KRUMMUNGSLINIEN 


§5. Eine spezielle Darstellung der Flache 


t 21. Die vorgelegte Fliche mit stetiger Tangentenebene und stetiger Kriimm- 
ung denken wir uns durch einen Vektor ¢ bestimmt, dessen drei zu einander 
rechtwinklige Komponenten x, y, z als dreimal stetig differentiierbare Funk- 
i tionen zweier Veranderlichen a, 8 gegeben seien. Halt man den Anfangs- 
et punkt des Vektors r fest, so beschreibe der Endpunkt die vorgelegte Flache, die 
al wir kurz die Fliche x nennen wollen. Diese Flaiche enthalte den Nullpunkt und 
habe dort die x, y-Ebene zur Tangentenebene. e sei der zur positiven z-Achse 
parallele Einheitsvektor, u sei der Normaleinheinsvektor der Flache r; sein 
Vorzeichen sei so festgelegt, dass im Nullpunkt u = —e wird. 

Nunmebhr bilde man die Flache r vermittels der Formel 


te 


(e — u). 


(19) 


auf die x, y-Ebene ab.” 

Aus Formel (19) folgt nimlich unmittelbar eX = 0 d.h. aber dass der Vektor ¥ 
der x, y-Ebene parallel ist oder aber, wenn der Anfangspunkt des Vektors % 
im Nullpunkt festgehalten wird, dass sein Endpunkt ein Gebiet der x, y-Ebene 
beschreibt. wu, v seien die beiden rechtwinkligen Koordinaten dieses End- 
punktes oder, was dasselbe ist, die beiden rechtwinkligen Komponenten von %, 
sodass also durch (19) der Flichenpunkt mit den Koordinaten z, y, z in einem 
Punkt der x, y-Ebene mit den Koordinaten wu, v tibergefiihrt wird. 


11 Vgl. 1. ce. Fussnote 7; 8. 44 Formel (106). 
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Um eine einfache geometrische Deutung der Transformation (19) zu erhalten, 
setze man die Grdésse 


(20) —— = W. 


Dann ergibt sich aus (19) 

(21) X¥+ We=1+ Wu. 

Bezeichnet man den in (21) definierten Vektor mit 8, so folgt nach leichter 
Rechnung aus (21) 

(22) ed8 = uds = dW, 


und jetzt erkennt man unmittelbar aus (21) und (22), dass die Normale r + tu 
der Flache r durch Spiegelung an der Tangentialebene der Flache 8 in die auf 
der z,y-Ebene senkrechten Gerade ¥ + ¢e iibergefiihrt wird (vgl. Fig. 4.). W 


zt 
WwW. 


xy Ebene 


Fia. 4 


wird gleich der Entfernung desjenigen Punktes auf 8, in welchem der zu spie- 
gelnde Strahl einfallt, einerseits von der z,y-Ebene, andrerseits von der Flache r. 


22. Die bisherigen Uberlegungen lassen erkennen, wie sich, wenn die Flache r 
vorgelegt ist, W als Funktion von u and v aus den Beziehungen (19) und (20) 
ergibt. Jetzt sollen die Formeln (3) der Einleitung abgeleitet werden, welche 
zeigen, wie man die Flache r bestimmt, wenn W als dreimal stetig differentiier- 
bare Funktion von u und v gegeben ist. 

Zu diesem Zwecke bilde man die von dem Normalvektor u beschriebene 
Kinheitskugel durch stereographische Projektion auf die z,y-Ebene ab. Dies 
geschieht vermittels der Relation” 


(23) 
Hierbei ist U ein offenbar zur z,y-Ebene paralleler Vektor, dessen Endpunkt die 
stereographische Projektion des Vektors u beschreibt, wenn der Anfangspunkt 


 Vgl. l. c. Fussnote 7; S. 44 Formel (107). 
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des Vektors im Nullpunkt festgehalten wird. Umgekehrt wird u aus U durch 


die Formel”® 
u—e = 2——_, 
bestimmt. 

Jedem Punkt r der vorgelegten Flache sind somit jetzt zwei Punkte der z,y- 
Ebene zugeordnet: erstens der Endpunkt des Vektors ¥ mit den Koordinaten 
u, v durch die Relation (19), zweitens der Endpunkt des Vektors U, dessen recht- 
winklige Koordinaten wir mit U, V bezeichnen wollen, durch Vermittlung der 
stereographischen Projektion des der Flache r zugeordneten Normaleinheits- 
vektors u nach Formel (23). Diese zwei Abbildungen r — ¥ und r — U sind 
aber dann sicher beide umkehrbar eindeutig, wenn man sich auf eine hinreichend 
kleine Umgebung eines konvexen Flichenpunktes r beschrankt. Hierdurch 
ist dann aber weiter eine umkehrbar eindeutige Abbildung der z,y-Ebene in 
sich, ¥ — Ul, bestimmt, die man sich analytisch durch das Funktionenpaar 

U = U(u, »), V = V(u, v) 


gegeben denke. 
In der oben zitierten Untersuchung findet sich nun aber eine Formel, welche 


den Flachenvektor r mit Hilfe der beiden ebenen Vektoren ¥ und U ausdriickt™, 
und zwar ist 


wobei sich zeigen lasst, dass 
Ud¥ = Udu + Vadv 


ein vollstandiges Differential ist. 
Zieht man noch Formel (24) heran, so erhalt man 


dk) (u — 
und nunmehr lasst ein Vergleich mit Formel (19) und (20) erkennen. dass 
0 


ist, d.h. aber, es ist 
(26) U=W,; V=W.,. 


Setzt man die so erhaltenen Werte fiir i Ud¥ und fiir die Komponenten U, V 
0 


18 Vgl. 1. c. Fussnote 7; S. 45 Formel (108) und (109). 
14 Vgl. 1. c. Fussnote 7; 8S. 47 Formel (112). 
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von Win Formel (25) ein und schreibt Formel (25) fiir die drei Komponenten 
x, y, 2 von x einzeln auf, so ergeben sich die Formeln (3) der Einhleitung, welche 
zeigen, wie sich die Flache ¢ vermittels der Funktion W(u, v) darstellen lisst. 


§6. Ableitung der Differentialgleichung der Kriimmungslinien 

23. In einem gewoéhnlichen Punkt der Flache rx, in dem die beiden Haupt- 
krimmungsrichtungen bestimmt sind, seien fiir den Augenblick die beiden 
infinitesimalen Vektoren in Richtung der Kriimmungslinien mit d,r, dex be- 
zeichnet. Diese beiden Richtungen werden durch die Abbildung (19) in die 
beiden infinitesimalen ebenen Vektoren d,X, d2X, durch die Abbildung (23) in 
in dU, iibergefiihrt. 

Nun gibt es aber einen Satz”, nach dem 

ERsTENS: d;X%-deX = 0, dill-del = 0; 

ZwEITENS: die Richtung zu und die Richtung zu parallel ist; 
d.h. aber es existieren zwei Funktionen ,(u, v), \2(u, v), derart dass 


(27) = dal = 
ist. 

Aus dieser Beziehung lasst sich in Verbindung mit (26) die Differentialgleich- 
ung der Kriimmungslinien fast unmittelbar herleiten. Trennt man namlich 


die Gleichung (27) nach Komponenten, so erhalt man etwa fiir die Richtung 
d,u: dy die beiden linearen Gleichungen 


= + Wadw = 
= + = didyw, 


und die Elimination von \; ergibt—indem wir jetzt einfach wieder du, dv statt 
du, dw schreiben—die in Formel (4) der Einleitung angegebene Form fiir die 
Differentialgleichung der Kriimmungslinien: 


War(du? dv’) 4 (Ww Wuu)dudv = 0. 
24. Nunmehr hat man noch, um die in Kap. I beschriebene Methode anwen- 


den zu kénnen, wie in Abschnitt 9. §1 Polarkoordinaten einzufiihren. Es werde 
wieder gesetzt 


(28) u=pcos0, v=psiné. 
Dann wird, wie eine leichte Rechnung zeigt, 


(29) 
W,= W, sin 9 + ™* cos 0 


* Vgl. 1. c. Fussnote 7; S. 44 die mit I und II bezeichneten Satze. 
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W,sin 20 | We 2) 


Ww\ cos 20 Wo . 
Die Transformation (28) fiihrt, wie in Abschnitt 9. §1 gezeigt wurde, die Differen- 
tialgleichung (4) in die Form (8) 

A(dp’ — p'dé’) + 2Bpdpdé = 0 
iiber, wobei man die Ausdriicke fiir die Koeffizienten A und B erhalt, wenn man 
in die Formeln (9) fiir X baw. Y die Ausdriicke (30) fiir Wu» baw. (Wor — Wan) 
einsetzt. Diese Rechnung ergibt schliesslich 


(31) 


Will man endlich auch in die Formeln (3) der Einleitung Polarkoordinaten 
einfiihren, so erhalt man wegen (29) die fiir manche Zwecke wichtige Darstellung 
der Koordinaten des Flachenpunktes durch die Funktion W(p, @); und zwar wird 


x = — 2W 
1+ Wi 


(32) W, sin @+ cos 6 


= 

L 2 


Kapiret III. Diz BestimmunG pres NABELINDEX 
§7. Der Rotationsnabel 


25. Die vorgelegte Fliche ¢ habe jetzt im Nullpunkt einen Nabel. Sollte 
dieser in einem speziellen Fall in einen Flachpunkt ausgeartet sein, so verwandle 
man diesen Flachpunkt vermittels einer geeigneten Transformation durch 
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reziproke Radien in einen gewohnlichen Nabel. Da bei dieser Transformation 
bekanntlich Kriimmungslinien wieder in Kriimmungslinien tibergehen, bleibt 
bei ihr offenbar auch der Index des Nabels invariant. Ist aber der Nullpunkt 
ein echter Nabel (kein Flachpunkt), so ist die Flache r in der Umgebung dieses 
Punktes gewiss konvex; die beiden in §5 beschriebenen Abbildungen r — X 
und g > Usind mithin umkehrbar eindeutig. 

Es sei nun in der Umgebung des Nullpunktes, d.h. etwa im Streifen 0 S p < 6 


| 
(33) W(p, 0) = + p™(wo(6) + F(p, 


Hierbei seien R und m zwei feste positive Zahlen, m > 2, wo(@) und F(p, @) 
dreimal im ganzen Streifen stetig differentiierbare Funktionen, die in @ die 
Periode 2x haben. Ausserdem mége w(@) nicht identisch verschwinden, ander- 
seits sei 


(34) lim F(p, 0) — 0. 


Setzt man jetzt die Funktion 
2 


Wo(p, @) = 


in die Gleichungen (32) von §6 fiir W ein, so bemerkt man leicht, dass die so er- 
haltene Flache der Gleichung 


(35) =R 


geniigt und somit eine Kugel vom Radius R ist, die die ,y-Ebene im Nullpunkt 
beriihrt. 

Andrerseits folgt jetzt aus (33), dass diese Kugel die Flaiche r im Nullpunkt 
von der Ordnung m — 1 > 1 berihrt; mithin ist der Nullpunkt ein Nabel der 
Flaiche und die Kugel (35) die zu diesem Nabel gehérige Schmiegungskugel.”® 


16 Um die Bedeutung der Voraussetzung, dass sich die Funktion W in der Umgebung 
des Nabels auf die Form (33) bringen lasst, besser zu verstehen, bemerken wir, dass diese 
Voraussetzung sicher erfillt ist, wenn 

1. die Ordnung m — 1 = 2 der Berithrung zwischen Flaiche und Schmiegungskugel im 
Nabel eine ganze Zahl ist, 

2. die Funktion W als Funktion von u und v betrachtet im Nullpunkt m + 1 mal nach wu 
und v stetig differentiierbar ist. 

Dann wird namlich 


cos" 6 sin™—* 


wie) = 


Ou" Oy™-# 


Offenbar ist aber die Voraussetzung des Textes allgemeiner, da in der Darstellung 
(33) fiir W erstens m eine beliebige positive Zahl > 2, zweitens wo(@) eine beliebige drei- 
mal stetig differentiierbare periodische Funktion sein darf. 
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26. Setzt man den Ausdruck (33) fiir die Funktion W in die Formeln (31) 0; 
fir die Koeffizienten A und B der Differentialgleichung (8) der Kriimmungslinien er 
ein, so ergibt sich 
(36) A = p™*((m — 1)(wo + Fe) + pF); 

(37) 2B = p™*(wo — — 2m)(wo + F) + Foo — (2m — 1)pF, — p Fy»), 
unter wo, wo die Differentialquotienten von wo(8) verstanden. Ferner folgt 
aus (34) 
(38) lim F, — 0, lim Fg — 0, in 
p70 
m 
(39) lim 0). 

27. I. Fauy. Es sei wo(@) gleich einer Konstanten wy , mithin wo =0. Dann ty 
lisst sich auf Grund von Formel (36), (37), (38) und (39) die Differential- 
gleichung (8) durch p”” dividieren, und man erhalt nach dieser Division und 
dem Grenziibergang p — 0 sel 
(40) mF (0, — (m* — 2m)wedpdd = 0. 
Hieraus folgt aber, dass die Differentialgleichung (8) jetzt auch in jedem Punkte a 
der Geraden p = 0 regular ist, und zwar ergibt sich aus (40): hi 

dp/2 (m— 2)wo 

i 

d.h. aber durch jeden Punkt der Geraden p = 0 gehen zwei Integralkurven ke 
hindurch, deren eine die Gerade p = 0 selbst ist. | 

Das bedeutet endlich fiir die Umgebung des Nabels der vorgelegten Flache r, 
dass, ebenso wie von dem Pol einer Rotationsfliche, von dem Nabel in jeder (4 
Richtung @ eine und nur eine Kriimmungslinie ausgeht. Diese strahlenférmig 
von 0 ausgehenden Kurven bilden die eine Schar der Kriimmungslinien; die Ax 
zweite Schar Krimmungslinien auf x sind entweder geschlossene Kurven oder va 
Kurven, die sich um den Nullpunkt als asymptotischen Punkt spiralférmig ele 
winden. In beiden Fiillen ist der Index des Nabels i = 0." Damit ist im Falle 
wo = konstant die Behauptung (2) bewiesen. 

Man sieht leicht ein, dass die Bedingung wy = 0 gleichzeitig notwendig und ge 
hinreichend dafiir ist, dass die Differentialgleichung (8) in jedem Punkt der die 
Geraden p = 0 regular ist oder, was dasselbe ist, dass vom Nabel in jeder Rich- Gl 
tung eine und nur eine Kriimmungslinie ausgeht. Wir nennen einen solchen om 
Nabel einen Rotationsnabel. Er hat die weitere fiir ihn charakteristische | 

Ma 


Higenschaft, dass nur zu einem Rotationsnabel eine Rotationsflache existiert, 
welche die vorgelegte Flaiche r im Nabel von héherer Ordnung, (nimlich der 


17 Vgl. 1. c. Fussnote 1; S. 53 Abschnitt 5. 
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Ordnung m) berithrt als die Schmiegungskugel (35). Diese Rotationsfliche 
erhalt man, wenn man die Funktion 


2 
m 
Wi 4R + wop 
in die Formeln (32) fir W einsetzat.”* 


§8. Beweis der Indexabschiatzung unter den einfachsten Voraussetzungen 


98. Il. Fatt. Wenn wo(@) nicht gleich einer Konstanten ist, so muss die 
Ableitung wo(@) im Intervall 0 < 6 < 2x mindestens zweimal verschwinden, da 
in ihm wo(@) als periodische Funktion wenigstens ein Maximum und ein Mini- 
mum haben muss. 

Wir machen jetzt zunadchst die weiteren Voraussetzungen: 

(a) wo(0) habe im Intervall0 < 0 < 2m nur endlich viele Nullstellen 6 = 0, (v = 
1,2,---,nj;n = 2), die sémtlich einfach sind, 

(8) der Ausdruck 

wo (6) — — 2m)wo(6) 


sei fiir allen Werte 6 = 0, von Null verschieden. 

Jetzt sind, wie man nach Division der Differentialgleichung (8) durch p 
(vgl. die Formeln (36) und (87) fiir A und B) unmittelbar erkennt, die Stellen 
p = 0, 6 = 8, die einzigen Singularitaten der Differentialgleichung (8), wihrend 
die Gerade p = 0 als Doppellésung von (8) durch alle reguliren Punkte p = 0 
hindurehgeht, bei denen @ von den n Werten @, verschieden ist. Hieraus schliesst 
man nach bekannten Satzen der Theorie der Differentialgleichungen weiter, 
dass vom Nabel, dem Nullpunkt der Flaiche rx, Kriimmungslinien nur in den n 
Richtungen @ = 6, ausgehen kénnen. Da wir im folgenden von dieser Tatsache 
keinen Gebrauch zu machen haben, begniigen wir uns hier mit dieser Andeutung. 

29. Aus (36) ergibt sich wegen (38) in Zusammenhang mit Voraussetzung (a) 
(41) lim 2 — (m — 1) wo(6,) ¥ 0 


Aus (41) folgt anderseits, dass sich zu jedem » fiir ein hinreichend kleines Inter- 
vall 0 < p S 6 eine Funktion 6 = 0,() eindeutig bestimmen lisst, welche 
gleichzeitig den beiden Bedingungen 


lim 0,(p) — 6,, A(p, 0,(p)) =0 
p—0 
geniigt. Mithin existieren im Intervall 0 < @ < 2” genau n Kurven ¢, , welche 


die in $1, Voraussetzung 5. beschriebenen Eigenschaften haben und durch die 
Gleichung @ = @,(p) analytisch dargestellt werden. 


'S Vgl. die Arbeit des Verf.: ‘Zur Theorie der sphirischen Abbildung im Grossen I.” 
Math, Zeitschr. 31 (1930), S. 629-708. Siehe insbesondere Fussnote 7; 8S. 638 und 639. 
Die zitierte Arbeit behandelt das Problem, eine geschlossene konvexe Fliche mit nur 
zwei Rotationsnabelpunkten zu konstruieren, wenn das spharische Bild der Kriimmungs- 
linien gegeben ist. 
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Damit sind aber ausserdem auch die in §4 angegebenen Voraussetzungen 6. 
und 7. erfullt, wobei die in Formel (15) auftretenden Exponenten k, (eben wegen 
der Voraussetzung (a)) simtlich gleich 1 sind. Es kann daher auch keine der 
Kurven ¢, dritter Art sein, da dann k, gerade sein miisste, und zwar ist nach 
Formel (16), (16’) und (17) 


c, erster Art, wenn sg aC ee = +1, 


c, zweiter Art, wenn sg 
» Ur 


Um diese Vorzeichen zu berechnen, beriicksichtige man, dass wegen (37), 
(39) und (41) 
Ao(p ,(p)) : Ao(p 0,(p)) 
2B(00,(0)) 


(m — 1)wo (6,) 1 


we (8 


(42) 


wobei der Nenner wegen Voraussetzung (8) nie gleich Null ist. 


30. Nunmehr behaupten wir: Setzt man in den Ausdruck rechter Hand von 
(42) fiir 6, nach einander die Werte 


ein, so kann dieser Ausdruck nicht zweimal hintereinander den Wert —1 anneh- 
men, und hieraus folgt dann unmittelbar, wenn wir die Bezeichnung (13) des 
§3 benutzen, 


{+} 2 {-}, d.h. aber z = 0 wegen (14) 


Bewets. Damit 


= —1, 


wo (6,) 
ist notwendig (aber nicht hinreichend), dass wo(0,)/wo (82) > 0. sg wo(@) = 
sg wo(6,) bedeutet aber in Verbindung mit wo(8,) = 0, dass die periodische 
Funktion wo(@) an der Stelle 6 = 6, entweder ein negatives Maximum oder ein 
positives Minimum hat. Wenn jetzt gleichzeitig 


sg ( — — 2m) 


wo(6,) 
0, 
Wo (6,) Wo (8,41) 


ware, so wirde das heissen, dass etwa auf ein negatives Maximum als nachstes 
Extremum ein positives Minimum folgen wiirde (bzw. umgekehrt ein negatives 
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Maximum auf ein positives Minimum). Da dies unméglich ist, so ist damit die 
Behauptung zu Beginn dieses Abschnittes bewiesen. 


31. Die im vorigen Abschnitt bewiesene Behauptung lisst sich auch so 
formulieren, dass zwei auf einander folgende Kurven ¢,, ¢,4: nicht beide zweiter 
Art sein kénnen, d.h. aber nach den Definitionen der Abschnitte 15., 16., 17. §3, 
dass unter den Voraussetzungen (a) und (8) alle durch die Kurven ¢,, ¢,4; begrenz- 
ten Bereiche 8, von 1. Art oder 3. Art, aber nie von 2. Art sind. Und hieraus 
folgt nun weiter leicht, dass sich jetzt immer ein aus Integralkurvenstiicken 
zusammengesetzter Polygonzug p konstruieren lasst, welcher nur ausspringende 
Ecken, aber keine einspringenden Ecken mehr enthalt. Man braucht zu diesem 


Fie. 5b 


Zwecke nur das in Abschnitt 18. §3 angegebene Konstruktionsverfahren so 
abzuandern, dass sich der Polygonzug p einzig aus den dort mit f, bezeichneten 
Bogen zusammensetzt, was sich unschwer erreichen lasst (vgl. Figur 5a). 

Ausgenommen ist hierbei allerdings der Fall, dass sich die eine Schar der 
Kriimmungslinien um den Nabel als asymptotischen Punkt spiralférmig windet. 
Dann wird sich ein Polygonzug » konstruieren lassen, der genau eine einsprin- 
gende und eine ausspringende Ecke enthalt. (vgl. Figur 5b). 


§9. Verallgemeinerung der Voraussetzung des §8 


32. Wir behalten die Voraussetzung (8) des §8 zunichst noch bei, ersetzen 
aber die Voraussetzung (a) durch die weniger fordernden, bereits in der Ein- 
leitung Abschnitt 7 angegebenen beiden Voraussetzungen 
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I. wo(6) habe nur isolierte Nullstellen, 
II. wo(@) habe nur Nullstellen erster oder zweiter, aber keine Nullstellen dritter 
oder héherer Ordnung; d.h. also, es soll nicht gleichzeitig fir ein 6 = 6, 


= 0, wold) =0 w(6) = 0 
sein. 
33. Es sei 0 = 6x cine Nullstelle héherer Ordnung von w¢(@) dh. es sei 
wo(Ox) = wo(Ox) = 0. 


Dann folgt aus Voraussetzung II: wo(8x) ¥ 0, und es ist wegen (37) 


(48) — 2m)wo(Ox) 0. 


Ausserdem lasst sich ein hinreichend kleines Rechteck 6x — ex S 6 S 6x + ex, 
0 < p S 6x, kurz mit Rx bezeichnet, bestimmen derart, dass in ihm wegen (43) 


(44) sg B = —sg wo(0x) 


A 
(45) <1 


und endlich auf den beiden vertikalen Rechteckseiten wegen (36) und (38) 
(46) sgA = sg wo(Ox — €x); sg A lomox-tex = sg wo(Ox + ex) 
ist. 

Wir sagen jetzt, 0 = 0x sei: 

eine Nullstelle erster Art, von wo , wenn 


wo(Ox — x) wo(Ox + x) _ 
(47) 1; +1, 


eine Nullstelle zweiter Art von wo , wenn 


Wo(Ox) 


wo(@x — ex) _ wolOx + ex) _ 
(48) 8Z Wo (6x) +1 ? Wo (Ox) ‘ 


eine Nullstelle dritter Art von wo , wenn 


wo(Ox — €x) wo(Ox + ex) 
(49) 


Das bedeutet aber: die Nullstelle 6 = 0x von wo ist: 

von erster Art, wenn Wo an der Stelle 0 = Ox ein positives Maximum oder ein 
negatives Minimum hat, 

von zwetter Art, wenn wo an der Stelle 6 = Ox ein negatives Maximum oder ein 
positives Minimum hat, 

von dritter Art, wenn wy an der Stelle 0 = 6x einen Wendepunkt hat. 
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34. Aus (45) folgt jetzt, dass die Reihe (10) des §2 fiir (dp/d@); im Rechteck 
Rx gleichmissig konvergent ist. Ausserdem kann die Bezichung (10) als eine 
iiberall in Rx regulare Differentialgleichung erster Ordnung aufgefasst werden. 
Mithin lasst sich ein Integralbogen fx konstruieren, der der Differentialgleichung 
(10) und damit auch (8) geniigt und ausserdem zwei Punkte Px, Px der beiden 
vertikalen Seiten des Rechteckes 6 = @x — ex, 0 = 0x + ex mit einander ver- 
bindet, ohne einen Punkt des Geradenstiickes p = 0 zu enthalten, das selbst 
eine Integralkurve von (10) ist. 

Da wegen (12) §2, (44) und (46) fiir 0 < p S dx 


“ (%) = — ex) 
6=0K—€K B K—€K wo(Ox) 
A wo(Ox + ex) 

88 1 86 B wo(Ox) 


ist, so hat der Integralbogen fx im Anfangspunkt Px und Endpunkt Px Tangen- 
ten, deren Vorzeichen durch (47), (48) oder (49) angegeben werden, je nachdem 
die Nullstelle 6 = @x von 1. Art, 2. Art oder 3. Art ist. Wenn wir nun die eben 
konstruierten Bogen fx mit den in §3 Abschnitt 18 beschriebenen Bogen f, 
vergleichen, so gelangen wir leicht zu folgendem Ergebnis: 

Beziiglich des Vorzeichens der Anfangs- und Endtangenten in den Punkten 
Px und Px verhalten sich die Bogen fx genau so, als ob sie 

eine Kurve cx von 1. Art schneiden, wenn die Nullstelle @x von 1. Art ist, 

eine Kurve cx von 2. Art schneiden, wenn die Nullstelle 6x von 2. Art ist, 

eine Kurve cx von 3. Art schneiden, wenn die Nullstelle 6x von 3. Art ist. 

Man konstruiere daher jetzt den Polygonzug p genau ebenso wie in Abschnitt 
18. §3, wobei man sich nur die Rechtecke Rx durch eine Kurve cx von gleicher 
Art wie die zugehérige Nullstelle 6x ersetzt denkt. Bei dieser Konstruktion 
sind wieder die Kurven cx 3. Art ohne Bedeutung, sodass man sie sich ebenso wie 
in §3 forgelassen denken kann. 

Nachdem man so die Hilfskurven cx , soweit sie 1. oder 2. Art sind, zu den den 
einfachen Nullstellen 6 = 6, von wo entsprechenden Kurven ¢, hinzugefiigt hat, 
zeigt man ebenso wie in Abschnitt 30. §8, dass zwei Kurven c¢, 2. Art nicht 
unmittelbar aufeinander folgen kénnen; und damit ist die Behauptung 7 2 0 
unter den Voraussetzungen (8) (Abschnitt 28. §8), I und II (Abschnitt 31. §9) 
bewiesen. 


35. Wir lassen jetzt die Voraussetzung (8) fallen, nehmen somit an, es giibe 
eine Nullstelle @ = 6, von wo , bei der 


(50) wo — (m? — 2m)wo(G,) = 0. 
Aus Voraussetzung II (Abschnitt 31.) folgt dann 
wo(6,) #0 0 
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und weiter wegen (50) 


(51) > 0. 


Wegen (41) existiert nach dem Satz iiber implizite Funktionen eine eindeutig 
bestimmte Funktion 6 = 0,(p), die gleichzeitigden beiden Bedingungen 


A(p, 9,(p)) = 0, lim 0, — 6, 
p=0 


geniigt. Die durch diese Funktion dargestellte Kurve ¢, ist nun 1. Art oder 2. 
Art, je nachdem 


og At Ag | 
2B 


ist. In beiden Fallen folgt aber aus (51), dass die Stelle @ = 0, entweder ein 
positives Minimum oder ein negatives Maximum der Funktion wo(@) ist, d.h. 
aber, dass weder die unmittelbar vorausgehende Kurve ¢,-; noch die unmittelbar 
folgende Kurve ¢,4: von 2. Art sein kann. Damit ist die ee 720 
unter den Voraussetzungen I und II vollstindig bewiesen. 


Ag | 
= +1 oder sg — = -—1 
+1 oder sg DB 


36. Aus der soeben bewiesenen Tatsache: unter den Voraussetzungen I und 
II kénnen nie zwei Kurven 2. Art c, und ¢,4; unmittelbar auf einander folgen, 
lisst sich ebenso wie in Abschnitt 31. §8 leicht schliessen, dass sich um den Nabel 
immer ein aus Stiicken von Kriimmungslinien zusammengesetzter geschlossener 
Polygonzug »* konstruieren lisst, welcher tiberhaupt keine einspringenden Ecken 
enthdlt; ausgenommen ist hierbet wieder nur der Fall, dass sich die eine Schar 
der Kriimmungslinien um den Nabel als asymptotischen Punkt spiralférmig windet. 

In dem spiater zu veréffentlichenden II. Teil wird die einschrankende 
Voraussetzung ITI fallen gelassen; es sind dann somit Funktionen w»(6) zuge- 
lassen, die Nullstellen beliebig hoher Ordnung haben. Dann wird man aber 
zeigen k6énnen, dass es Fille gibt, in denen zwei unmittelbar aufeinander fol- 
gende Kurven c, zweiter Art auftreten, die beide vom gleichen Punkte @ = 0, der 
Geraden p = 0 ausgehen. In diesem Falle lasst sich um den Nabel nicht ohne 
weiteres ein Polygonzug p* konstruieren der nur ausspringende Ecken enthilt. 
Die Tatsache, dass dieser Ausnahmefall auftreten kann, ist wohl der Hauptgrund 
dafiir, dass die Untersuchungen des II. Teils wesentlich schwieriger und 
langwieriger sind als im I. Teil. 


BERLIN, GERMANY. 
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A system L of elements a, b, ¢, x, y, --- is said to be partially ordered if a rela- 
tion a S b (equivalently, b = a) is defined for certain pairs of elements in such 


a way that 
I: a b,b S and only a = b. 
I,: a S b,b S c together imply a S c. 


A subset S of Lis said to have a sum (intersection) in L, written >> (S)([] (S)), 
if for every cin L, ec = >> (S) (ec S [J (S)) if and only if ¢ = a(e S a) for 


every ain S. 
A partially ordered L is called a lattice if 


Il: Every finite subset a,, d2,---,@n has a sum )-3-14; (written also 
a + a2 + + ap) and an intersection []}-1 a; (written also --+ an).” 


A lattice is said to be countably continuous if 


IIIj: Every countable subset a; , a2, --- has a sum >>3 a; and for every cin L, 


a; = (c ai). 


Every countable subset a2, --- has an intersection a; and for 


every cin L, + = (ce + 
A lattice is said to be continuous if 


IIl,: For every ordinal Q and for every set of elements az, a < Q, there is a 
sum Ge, and for every cin L, ¢ Daca da = >: (C > Ga). 

III,: For every ordinal Q and for every set of elements az, a < Q, there is an 
intersection [Jaco Ge , and for every cin + [Jaca da = [J yea (¢ + 


A lattice is called modular if 
IV: a S c implies that (a + b)c = a + be for all b in L. 
A lattice is called complemented if 


V: Whenever a S b S ¢ there exists an element d (not necessarily unique) such 


thatb+d=candbd =a. (Suchad will be written [c — b], .) 


Continuous complemented modular lattices were defined and studied by one 
of the writers and it was shown that any such lattice which satisfies a certain 
irreducibility condition is either a projective geometry or a geometry of a new, 


‘ Sterling Research Fellow, Yale University. 
* The sum (intersection) is clearly unique if it exists at all. 


* This would be implied if every pair of elements a, b had a sum a + 6b and an inter- 


section ab, 


‘ By the well-ordering theorem every set of elements a; , { ranging over an arbitrary set 


of subscripts, can be arranged as a set da, a < &, for some ordinal ©. 
87 
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non-finite dimensional type which was named continuous geometry.” Of great 
| importance in this work are the notion of perspectivity (a is perspective to b, 
written a ~ b, if for some z, called an axis, a + x = b + x and ax = bz), and 
1 the transitivity of the relation of perspectivity, that is, for any finite n, 

1 


(*) ae, ~ M3, +++ ~ Any together imply a ~ 


The general validity of (*) was established by proving first the special case 


n+1 
(**) dg ~ 5 An ™ = ai, 


together imply a, ~ 


In the present note we give simple proofs of theorems concerning perspective 
| mappings in modular lattices, from which (**) follows. While the previous 
\ proof of (**) is valid in every countably continuous complemented modular 
| lattice’ the present proof shows that (**) holds in every complemented modular 
| lattice which satisfies III, (but not necessarily IIIs). 
In what follows we assume only that L is a modular lattice. 
Derinition 1.° L(a, b) is the set of all c in L for whicha < ¢ S b. 
Lemma 1. Ifa~ b with axis x and c = ax = ba, then a (1, 1) correspondence 
between the elements of L(c, a) and those of L(c, b) which preserves the relation < 
(hence also the operations of sum and intersection) is defined by the inverse mappings 


(T) Ta = (a,+2)b for cS 
| (S) b, > Sh, = + for <b. 
f Proor. Since (a; + 2)b 2 bx = c and (a, + x)b S b it follows that Ta 


is an element of L(c, b). If a, is an element of L(c, a) then, from the modularity 
of L, we have 


= {(a + 2)b + 


(a +2)(b+2)a = (a+ z)a 

that is, STa,; = a,. Similarly, 7'Sb; = b, if b; is an element of L(c, b). Thus 7, 
S are inverse mappings which define a (1, 1) correspondence as stated. It is 
clear from the definition of 7, S that the relation < is preserved. 


DerFiniTion 2. The mapping T of Lemma 1 is called a “perspective mapping.” 
The axis x is said to determine the mapping. 


5 See J. v. Neumann: Proceedings of the National Academy of Sciences, vol. 32 (1936), pp. 
92-100, and other references given there. The axioms formulated there are slightly differ- 
ent from but equivalent to those used here. 

° See J. v. Neumann: Continuous Geometry, planographed, Institute for Advanced Study, 
Princeton, N. J., 1936-38, particularly Part I, Theorem 4.2. 

7 Even the general (*) holds in any countably continuous complemented modular 
lattice. See I. Halperin: Transactions of the American Mathematical Society, vol. 44 (1938), 
pp. 537-62. 
ie 4 8 Definitions 1-5, Lemmas 1-5, 8, 10 and Theorem 1 are taken from Part I of the work 
referred to in footnote (6). The ‘independence over c’’ defined in Definition 3, would be 
“independence” as defined in loc. cit. if ¢ were a “zero” or “least’’ element in L. 
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Lemma 2. If the axis x determines a perspective mapping T of L(c, a) on 
L(c, b) then x(a + b) determines the same mapping. 

Proor. z(a + b) is an axis of perspectivity between a and 6b since: 
a+2(a+b) = (a+2z)(a+ = a+ similarly, b + c(a+b) =a+b= 
a + a(a + b); and az(a + b) = ax = c; similarly, br(a + b) = c. Now 
z(a + 6) determines the mapping 7’ since for all c S a, S a we have 
‘a, + x(a + b)}b = (a + x)(a + b)b = (a + = Tay. 

Lemma 3. If T is a perspective mapping of L(c, a) on L(c, b) ande S a, Sa 
then the T-mapping of L(c, a) on L(c, Ta;) is a perspective mapping; if c S 
a, < ab then this T-mapping is the identity mapping of L(c, a;) on itself. 

Proor. Let x be the axis determining 7. Then 

ax = ajax = ¢; (Ta,)x2 = (a, + x)bx = ¢; 
= = (a+2)b4+2 = (Ta) +2; 


hence a; ~ Ta with axis x. Now the T-mapping of L(c, a) is the perspec- 
tive mapping determined by the axis x since for all c S az S a, we have 
(a2 + x)Ta, = (a2 + + = (a2 + 2)b = Ta. Ife S S abitis 
clear that for all ¢c S az S a, we have Taz = (a2 + x)b + a2 + ba = a. 

DeFIniITION 3. The elements a, d, b are said to be ‘‘independent’’ (over c) if 
a(d + b) = d(a+ b) = b(a + d)(= ¢). 

It is clear that if a, d, b are independent, they are independent over a 
unique c. 

Lemma 4. a, d, b are independent (over c) if and only if ab = d(a + b)(= ce). 

Proor. The necessity of the conditions is clear since from Definition 3, 
d(a + b) = cand ab = ab(a + d)(b +d) = cc = c. On the other hand, if the 
conditions are satisfied, then a(d + b) = a(a + b)(d + b) = afb + d(a+b)} = 
a(b + ab) = ab = ¢; similarly, b(a + d) = c. Hence a, d, b are independent 
over ¢. 

Lemma 5. If a, d, b are independent then (a + d)(b + d) = d. 

Proor. (a+ d)(b+d) =d+a(b+d) =d+d(a+b) =d. 

TuEeorEeM 1. Let 7 be a perspective mapping of L(c, a) on L(c, d), let Tz be a 
perspective mapping of L(c, d) on L(c, b), and let T be the product mapping T27'; . 
Tf a, d, b are independent then T is a perspective mapping. 

Proor. By Lemma 2 there are axes 2 , %2 determining 7’, , 72 respectively, 
such that S$ (a + d) and S (6 +d). Definex = 2, +22. Thena~b 
with axis z since: 

ax = a(a + + = + (a + d)a(b + d)} 

= a(x, + da) = at = ¢; 
similarly, = 


b+ 2. Now T is the perspective mapping of L(c, a) on L(c, b) determined by 
the axis zx since for all c < a; S a we have 


(a + x)b = (a; + a+ x2)(b + d)b = { (a + (a + d)(b + d) + 


= + + = = Ta. 
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Lemma 6. Let T be a perspective mapping of L(c, a) on L(c, b) and let 7, 
be a perspective mapping of L(c, d) on L(c, a). If d(a + 6) S ab then there 
exists a perspective mapping 1's of L(c, d) on L(ec, b) such that T = T:(T,)™. 

Proor. Let x be an axis determining 7 such that « S (a + b); let x be an 
axis determining 7; such that x; < (a + d) (x, 2 exist by Lemma 2). Define 
te = x + a. Then 2 determines a perspective mapping 72 of L(c, d) on 
L(c, b) since: 


and 
dx. = d(a + d)(x2 + 1) = + (a + b)a(a + d)} 
= dja + z(a + d(a + b))} = d(a + ax) = dx = ¢; 
similarly, 
baz = b(b + a)(x + m1) = bfx + (a + b)ai(a + d)} 
= b( + am) = br = €. 


Now 7 = 172(7;)™ since for all ¢ S a S a we have 


+ + = = ((a + + d) + 
= (a +a+2z)(a+ bb 
= {(a +z) + (a+ + b)}b = (a, + + am)b 
= (a, + = Ta. 


THEOREM 2. Let 7; be a perspective mapping of L(c, a:) on L(c, ai+1) for 
a= 1,---,n. If there exists an element d with d(a; + Gis) S Gidiys Jor 
i = 1,---,n and d(a, + Gny1) = G10n4,, and a perspective mapping T of 
d) on a), then the product mapping T = T,Tn-1--- ts a@ per- 
spective mapping of L(c, a1) on L(c, dni). 

Proor. By successive applications of Lemma 6 we can find perspective 
mappings 7; of L(c, d) on L(c, a;) such that T; = Tis(T3)". Then 


T = ToT; 


Since d(a; + @n41) = Lemma 4 shows that a: , d, are independent. 
Theorem 1 then shows that 7’ is a perspective mapping as stated. 

THEOREM 3. Let T; be a perspective mapping of L(c, a) on L(c, d), let T: be a 
perspective mapping of L(c, d) on L(c, b), and let T be the product mapping T27 . 
If ad = db = cand a + Tig = Ta + Tim for allc < a S athenT isa 
perspective mapping. 

Proor. Since T7,;a = d and Ta = b we have a + d = d + b and ad = 
db = c; hence d determines a perspective mapping of L(c, a) on L(c, b). This 
mapping is identical with 7 since for all c S a S a we have (a, + d)b = 


(ay + Tia, + d)b = (Ta, + Ta; + d)b = Ta, + db = Ta, . 
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DEFINITION 4. @ < b means thata S b but a # b. 

LemMa 7. Let T; be a perspective mapping of L(c, a) on L(e, d), let Tz be a 
perspective mapping of L(c, d) on L(c, b) and let T' be the product mapping T21’, . 
If c < aand L is complemented then there exists an a, with c < a, S a such that 
that the T-mapping of L(c, a) on L(c, T'a,) is a perspective mapping. 

Proor. Ife < ad we can take a; = ad; for Lemma 3 shows that the 7'-map- 
ping of L(c, ad) is the identity mapping and that the 7:-mapping, which then 
coincides with the 7T-mapping, is a perspective mapping. Since c S ad we 
need therefore consider only the case c = ad. Similarly, we may suppose that 
c = db. If now ag + Tia2 = Ta, + Ta for alle S a, S a then Theorem 3 
shows that 7’ itself is a perspective mapping. If for some c S a S a we 
have d2 + Ta, + Tia, then either (i) a2(Ta: + Tya2) < ae or (ii) 
(T'az)(a2 + T1a2) < Taz. If (i) holds, define a; = [az — a2(Ta, + . 
Thence < a S az, hence = = c and a,(Ta, + Tya,) = 
a(Ta, + + Tia2) = c. Then Lemma 4 shows that a,;, 7'a 
are independent. Now Lemma 3 and Theorem 1 show that the 7'-map- 
ping of L(c, a) is a perspective mapping. If (ii) holds, define a, = 
T'[(Ta2) — (Tas)(a2 + Tia2)|.. Then < a S ae, = c, and 
+ = + Tya1)(a2 + Tia2) = c. Hence a, 
Ta; are independent and the 7-mapping of L(c, a,) on L(c, T'a;) is a perspective 
mapping. This proves the lemma. 

Lemma 8. If T; is a perspective mapping of L(c, a;) on L(c, ai41) for i = 
1,---,n and T is the product mapping T,T,-; --- T2T;, then T is a (1, 1) 
mapping of L(c, a,) on L(c, Gny1) which preserves the relation S. 

Proor. This is immediate from Definition 2. 

DerinitTion 5. The mapping T of Lemma 8 is called a “projective mapping.” 

Lemma 9. Let T be a projective mapp.ng of L(c, a) on L(c, b). Ife <a 
and L is complemented then there exists an element a; with c < a, S a such that 
the T-mapping of L(c, a:) on L(c, Ta) is a perspective mapping. 

Proor. This follows from successive applications of Lemma 7. 

Derinition 6. A (1, 1) mapping T of L(c, a) on L(c, b) which preserves the 
relation S, is called “‘piece-wise perspective’ if for some ordinal Q a can be ex- 
pressed as a sum a = Ua , where for everya <Q 

(i)a S Ua S cand, ifa>,c < ue. 

(ii). Uau" = c for some u* with a = u* = us for all B < a. 

(ili). The T-mapping of L(c, ua) on L(c, Tua) is a perspective mapping. 

THEOREM 4. Every projective mapping in a complemented modular lattice is 
piece-wise perspective. 

Proor. Let 7 be a projective mapping of L(c, a) on L(c, b). Now define 
u% = u' = c and define uz, u* for a > 1 by transfinite induction as follows: 
Suppose that for some ordinal y we have already defined u, , u“ to satisfy (i)a , 
(ii). , (iii). of Definition 6 for all a < y. If there exists an element d such that 
d = ue for all a < y and d = a does not hold, then define u” = ad, and y = 
la — u7|.. Thene < y. From Lemmas 3, 9 there exists an element u, such 
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that ¢ <u, S y and the 7-mapping of L(c, uy) is a perspective mapping. It 
is clear that = u,ju’y = c and hence (i), , (ii), , (ili), are satisfied. But 
from (i)e, (ii)«, it follows that the u. are mutually different and hence cannot 
have greater cardinal power than the set of all elements in L. Thus it must 
happen for some 2 that no d = wa for all a < Q exists such that d = a does 
not hold, that is, a = )cacaUa- This proves the theorem. 

Lemma 10. If a ~ b and c is an element < ab and L is complemented, then 
there exists a perspective mapping of L(c, a) on L(c, b). 

Proor. Let a ~ b with axis x and define y = [(x + ab) — ab].. Then 
y < (x + ab), hence ay = a(x + ab)y = (ax + ab)y = aby = c; similarly, 
by 2g; similarly, b + y = 
b+2=a+z2. The lemma now follows from Lemma 1. 

TuHreorEM 5. Let a, ~ de, ~ G3, Gn ~ hold in a complemented 
modular lattice. If there exists an element d such that d ~ a, d(a; + diy) S$ 
for alli = and d(a, + Gn41) = then Qa ™ Ant. 

Proor. Let c = a@n,:. Then clearlyc S dande Sa. Ife S a; for 


any 7 it follows that ai: 2 aidiz, 2 d(ai + Gizi1) 2 cc = c, thatisec S ain. 
Thus by induction on 7 we have c S a;foralli = 1,---,2+1. From Lemma 
10 it follows that there exists a perspective mapping of L(c, a;) on L(c, aj;:) 
for all i = 1,---,n and of L(c, d) on L(c, a;). The theorem now follows 
from Theorem 2.° 

THEOREM 6. Let a; ~ Gz ~ 3, +++, Qn ~ hold in a complemented 
modular lattice which satisfies If = [] a; then a, ~ days. 

Proor. Define ¢ = ad@n,,:. Then by Lemma 10 there exists a perspective 
mapping of L(c, a;) on L(e, a;41) for allt = 1, --- , n, hence a projective mapping 
T of L(c, a:) on L(c, ani). It follows from Theorem 4 that we can express a 
as = Ua Where ¢ S Ua S a1, sca = Ug) = 6, 
and the 7-mapping of L(c, wa) is a perspective mapping for all a < Q. By 
Lemma 2 we may assume that the 7-mapping of L(c, ua) is determined by 
an 2% such that ta S (Wa + Tua). Now define x = ) pa Yq. Since 


(Tuy) Tua) = T(u, Ua) = Tc = ¢, 


and Ua) (Tuy + Tue) = Ua) (Tuy + = 
it follows by Lemma 4 that (SVacyla)(Soacy Ta), Tu, are independent over 
c. Similarly, Ua), Uy, (Doacy Ta) are independent over c. Then 


+ ay} { + Uy} 
+ + + uy)} { + Uy} 


* This shows that (**) holds in any modular lattice which can be suitably enlarged to 
provide an element d with d ~ a; and d(2*t}a;) = aydn+1 . 


© In such a lattice there exists a zero element and an extensive theory of independence 
(see loc. cit. footnote (6)). The use of this theory would simplify the proof of Theorem 6. 
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= {(Qeacrta) + + (Uy + + Te) } 
= { + 2,[u, + (Tuy) Ma Ta) |} (Quasy te) 

= te) 

= He): 


Then, using III,, we have 


Since au = ¢ it follows by transfinite induction on 8 that (Doacst«)(Dvacp Ua) 
= c for all 8. Hence, setting 8 = Q, we have za, = c. Similarly, ran4; = c. 
Finally, 


a t+2= (ocata) + = (MacaTue) + = Gay: + 2. 


Thus a; ~ @n41 With axis x and the theorem is established. 

Let a; ~ de, ~ G3, +--+, Gn ~ hold in a complemented 
modular lattice which satisfies If ayany, = 0," then a, ~ 

Proor. This is clearly a particular case of Theorem 6. 


Tue InsTITUTE FOR ADVANCED Stupy, AND 
HARVARD UNIVERSITY 


11) denotes the zero element of L (see footnote (10)). 
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ON RINGS OF OPERATORS. III 
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(Received June 14, 1939) 


INTRODUCTION 


ahaa: 1. In two earlier papers ((1), (2)), F. J. Murray and the author investigated 
i certain operator rings, called factors. (Cf. (1), p. 188, Definition 3.1.2. For an 
ne introduction into the theory of operator rings ef. (5) particularly pp. 372-376, 
te 388-398.) The motives which led to those investigations are described in (1), 
pp. 116-123. 

The main principle of classification for factors, which was found in (1), is 
based on the ranges of their relative dimension functions. (Cf. ibid., p. 165, 
Definition 8.2.1; p. 168, Theorem VII; and p. 172, Theorem VIII.) Thus all 
| factors were found to belong to classes called (In) (n = 1, 2, ---), (I,), (Ih), 
| | (II,,), (II1,,). It was shown that factors in each one of these classes actually do 
exist—with the exception of (III,,), which had to be left undecided. (Cf. (1), 
p. 208, Theorem XII.) 

The main result of this paper is that factors of class (III,,) also exist. (Cf. 
Theorem IX, and Examples (a), (8) in §4.4.) Thus the above classification of 
factors is fully justified. 


2. This result is obtained by the simultaneous use of two devices: The no- 
tions of norm and normedness, which form the subject of Chapter I; and the 
| process A'*'!*?!"" which is the analogue of the process of taking the diagonal 
| part of a finite matrix, and is discussed in Chapter II. 
t Both notions have an interest of their own, and we think that they deserve 
| to be studied for their own sake. In this paper their analysis is restricted to the 
amount which is necessary for our immediate purposes, although we permitted 
ourselves a certain leeway in some cases where a little more generality than 
absolutely necessary seemed to make things clearer and easier. But we propose 
to discuss both of them independently and more fully at some other occasion. 
Chapter III gives some explicit constructions of factors, generalizing those of 
(1), pp. 192-204. And then, with the help of the above-mentioned tools of 
Chapters I, II, it is shown in Chapter IV, to which classes the factors of Chapter 
III belong. At this stage examples of factors of class (III,,) are also obtained. 


3. The main questions concerning factors were stated in (1) as Problems 1-11. 
Of these, Problems 1, 2, 5, and parts of 3, 4, 6, 7, 8, 9, 10 were answered in (1); 
Problem 11 was answered in (2). Our present result will settle the remainder of 
Problems 3, 4. Important parts of 6, 7, 8, 9, 10 remain unsettled. Especially 
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the following question is still open, and it is in our opinion of a quite decisive 
character: Are all factors of class (II,) isomorphic to each other? (Part of 6.) 
We think that the answer to this question will greatly affect the applicability 
of the theory to factors, especially to quantum mechanics. 

F. J. Murray and the author believe that the answer is negative, and that 
further invari.nts—probably of an algebraical and group theoretical nature— 
exist. 

Certain partial results have been obtained, and they will be discussed else- 
where. But the main question (cf. above) is still unanswered. 


4. The work (7) of the author also led to factors. The parts of the ring @*' 
in the various incomplete direct products TTee~as.-:- (Den, ® DHn,2)) ((7), p. 71 ’ 
Lemma 7.3.1) were found to be in certain cases factors of classes (I,), (II:), 
and (II,,). (Cf. ibid., pp. 71-77, in particular Lemmas 7.4.1, 7.5.1.) It is not 
difficult to show that the above-mentioned parts are always (that is, in every 

Application of our present results, namely of our Theorem IX, shows that 
these factors are of class (III,,) in certain cases. 

More precisely: It was shown in (7), p. 71, Lemma 7.3.1, that the part of C*? 
in @ depended essentially (that is, up to isomorphisms) 
only upon a certain sequence a, a2,--- of constants 20, $1. It was also 


shown ibid., pp. 71-77, that its class was (I,,) for a1 = ag = --- = 1, (Ij) for 
a, = a = --- = 0, and stated that it was (II,) for a. = a3 = --- = 0, a = 
a; = ++» = 1. Now we can show this: The class is (III,,), if for some 6 > 0 


we have for infinitely many n = 1, 2,---,6 Sa, 21 — 6. 

The factors obtained by the above process have various instructive aspects, 
and will be discussed (together with the above-stated results) elsewhere. 

The surmises stated at the end of (7) have to be modified in accordance with 
the results stated above. 


5. For an account of modern operator theory in general, the reader is referred 
to (3) or (4). For the other topics touched, a general orientation may be 
obtained from (5), (1). A familiarity with the methods and results of these 
papers will be assumed. 

Our notations are the same as in the papers enumerated above, for a detailed 
description ef. (1), pp. 126-127. 

A detailed table of contents and a bibliography follow. 
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CuapTer I. THe RevatTiveE TRACE IN AN ARBITRARY FACTOR 


1.1. Let S be a Hilbert space, and 9il a factor in §. The notation of the 
relative trace Toy (A) was defined for all Hermitean A ¢ 9M in (1), pp. 212-213, 
Definition 15.1.1, and for all A ¢ 9 without restriction in (2), p. 218, Definition 
2.2.1, provided that DN belongs to one of the finite classes: (I,) (n = 1, 2, --- ), 
(II,). We shall now extend this definition to all factors 9X but (when 9M is 
infinite) not all A «9. We shall see, in particular, that when MM is purely 
infinite, ie. of class (III,), our definition covers A = 0 only. Hence the 
factors ON for which we really achieve something, are those in the not purely 
infinite classes: (I,,), (II,,). (For (I,,) in particular, we get the usual notion of 
the trace and the E. Schmidt class of operators in DN cf. the discussion (B) in 
§1.6 below. Nevertheless, we shall use this extended notion of the relative 
trace in this paper for a factor 9M of unknown class, with the purpose of showing 
that OM is just of class (III,).) 


1.2. Let SN be an arbitrary factor, D(M) its relative dimension function, in 
any normalization. ((1), p. 165, Definition 8.2.1, and p. 168, Theorem VII.) 
For any A eM the closed, linear set [Range A] can be formed, and is » DM. 
(This notion is defined in (1), p. 141, Definition 4.2.1. If a proof of the state- 
ment is wanted, cf. (1), pp. 151-152, Lemmas 6.1.1, 6.2.1.) 

DEFINITION 1.2.1. For every A ¢M we define a “rank’’ as follows: 


Rank (A) = D({Range A)). 


(The reader will easily verify that if i = & = the ring of all bounded operators 
in §—which is of class (I,,)—and if D(M) is chosen in the standard normaliza- 
tion—(1), p. 173, Lemma 8.6.1—then this is the matrix-rank in its usual sense.) 
Together with D(9M), Rank (A) can assume finite or infinite numerical values. 
When the former is the case, we say that A has finite rank. 
Lemma 1.2.1. Rank (A) has the following properties: 
(i) Rank (A) = 0, it is = 0 if and only if A = 0. 
(ii) Rank (A) = Rank (A*). 
(iii) Rank (aA) = Rank (A) for every complex number a # 0. 
(iv) Rank (A + B) S Rank (A) + Rank (B). 
(v) Rank (AB) S Rank (A), Rank (B). 
(vi) Rank (E) = D(E) for every projection E « MM. 
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PROOF: 
Ad (i): Rank (A) = D({Range A]) is clearly 2 0, and = 0 if and only if 
[Range A] = (0), ie. Range A = (0), which means A = 0. 

Ad (ii): Coincides with (1), p. 152, Lemma 6.2.1. 

Ad (iii): Since A is linear, already Range (aA) = Range (A) if a ¥ 0. 

Ad (iv): Clearly Range (A + B) C [Range A, Range B], hence [Range 
(A + B)| C [[Range A], [Range B]], whence (1), p. 210, Lemma 14.2.3, 
gives the desired result. 

Ad (v): Clearly Range (AB) C Range A, hence [Range (AB)] C [Range 4], 
and so Rank (AB) < Rank (A). Combining this with (ii), we obtain: 
Rank (AB) = Rank ((AB)*) = Rank (B*A*) S Rank (B*) = 
Rank (B). 

This completes the proof. 

Ad (vi): If E is the projection of the closed linear set IN, then Range E = M, 
hence [Range E] = M. 

Lemma 1.2.2. The finiteness of rank has the following properties: 

(i) 0 has finite rank. 

(ii) A has a finite rank if and only if A* has a finite rank. 

(iii) A has a finite rank if and only if aA has a finite rank, a being any fixed 

complex number 0. 

(iv) A + B has a finite rank, if the same is true for both A and B. 

(v) AB has a finite rank, if the same is true for either A or B. 

(vi) E has a finite rank if and only if it is finite in the sense of (1), p. 155, Defini- 

tion 7.1.1, E being any fixed projection ¢ M. 
Proor: These (i)-(vi) are immediate consequences of the corresponding 

(i)—(vi) in Lemma 1.2.1.— 

Throughout what follows we will make use of the results of these Lemmas 

1.2.1, 1.2.2, without referring to them explicitly. 

DeriniTIon 1.2.2. A closed, linear set M—or its projection E M ((1), 

p. 141, Lemma 4.2.1)—‘“‘contains” an A ¢€ 9M if 


[Range A], [Range A*] C M. 
Lemma 1.2.3. E contains A if and only if 
EA =AE= 4A. 


Proor: [Range A] C M is equivalent to Range A C M, that is, to Af eM 
for every f « , that is to HAf = Af. In other words: To EA = A. 

Similarly [Range A*] C I means HA* = A*, that is (applying * to both 
sides) AE = A. 

Hence both together amount to HA = AE = A. 

Lemma 1.2.4. A (e 9M) is of finite rank, if and only if it is contained in a 
finite M (n 

Proor: Sufficiency: [Range A] C M, M is finite, hence [Range A] is finite, 
that is, Rank (A) = D([Range A]) is a finite number. 
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Necessity: A has finite rank, hence A* has also finite rank, so [Range Aj, 
|Range A*] are finite, and with them 92 = [[Range A],-|Range A*]] ((1), p. 160, 
Lemma 7.3.5). This 2% meets all requirements._— 

Given a finite 9i)—and its projection E (e9i)—then those A 
which are contained in Yt are by Lemma 1.2.3 precisely the elements which 
the correspondence X = Xm) (discussed in (1), pp. 186-187, Definition 11.3.1, 
Lemmas 11.3.2-11.3.4) maps on the factor Mic) in M (instead of H!). So we 
see: If A runs over all elements of 9M which are contained in IM, then A (gq) runs 
over all elements of Miia) . 

The factor 9m) necessarily belongs to one of the finite classes: (I,) (n = 
1, 2,--- ), (Ili) (2), p. 188, Lemma 11.3.7, or p. 189, Lemma 11.4.3). Hence 
there exists a unique relative trace in Sym) defined for all A gmeMy : 
THM yy (Ac). (For the definition of ef. the beginning of §1.1, 
above.) We now define: 

Derinition 1.2.3. If A eM and of finite rank, then define for every finite, 
closed, linear set MOM which contains A (such M exist by Lemma 1.2.4) 


(A) D(M)T May) (A cy). 


(This TH ay (A) depends also on the normalization of D(M), but we do not 
need to indicate this explicitly.) 

Lema 1.2.5. The TOiuq(A) of Definition 1.2.3 depends on A only, and not 
on M (as long as A, Mt are related as described there). 

Proor: In other words: If MM, MN are both finite, closed linear sets, both » ON 
and both containing A, then we claim that 


(1) TH (A) = (A). 


Consider first the case where I C NR. Consider the operators Aw € Mim . 
Form the operator Ag x in R, which is reduced by 2 and which coincides with 
Ay» in M and with 0 in NR — Mt. We claim that Agjx eM. Indeed: 
Ay implies Am = Ag for an As = Pye M, we have 
(EAE), = Aw, EAE € 9M, so we may replace A by EAE, which coincides 
with 0 in § — MM. That is: We may assume that A itself coincides with 0 
in § — M. Hence = Aw € Mua 

Observe further: 

(i)* Em) is the unit in M, hence for Man) . And Tim 2 Ew 
(ii)* (a-Am)\x = a-Agin 
(ii)* (Am + Ba)in = Amn + - 
(iv)* If Am is definite, then Amy is also definite. 
(v)* (Am) ix = Anis 
(vi)* = . 
Now put for every Ag € Man) 


(2) = 


DO) 
DM) 


TI (A 
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Comparing (2) and (i)*-(vi)* with (i)-(vi) in “Property IIT” in (2), pp. 
218-219, we see immediately that ¢(As) fulfills (ii)-(vi), including (iv)’, men- 
tioned there (for Mim , Pin place of M, H). (vi)’ is also fulfilled, since it is a 
consequence of (vi). (Put U-"A, U for A, B.) Let us finally check (i). Con- 
sidering (2) and (i)*, it becomes 


1 = TM») (Eon), that is 


Now we know from “Property III,” (*), in (2), pp. 218-219, together with 
(1), p. 219 (end of the first paragraph), that we have for every idempotent 


F = Px $C N, TN xy) (Fo) D'(F ay) D'(§), 


where D’($) is the relative dimension function of 9M) in its standard nor- 
malization—that is, with D’(9t) = 1. By (1), p. 188, Lemma 11.3.7, D(¥) 
is also a relative dimension function of 9i.») , but in order to make its normaliza- 
tion the standard one, we must divide it by D(Jt). So D’($) = D(P)/DM), 
and consequently = Now putting $B = M, F = 
gives (3). Thus (i) is also established. 

We have thus shown that ¢(A) fulfills all conditions (i)-(vi)’ in “Property 
III” in (2), pp. 218-219. Hence by “Property IV” in (2), p. 219, we have 
TM) (Am) = Or, if we substitute this in (2): 


(4) D(M)T (Am) = (Amz), 


Consider now our original A. Since it is contained in Jt, so HA = AE = A 
by Lemma 1.2.3, therefore it coincides with 0 in © — Mt. Therefore we have 
for Am = Am also Amjx = Aq. By this substitution, however, (4) become 


D(M)TM = (Ac), 


which, by Definition 1.2.3, is precisely (1). 

This completes the proof for the case where It C MN. 

Consider next the case where Jt, Jt are arbitrary. Then form PB = [Me, Nl. 
Along with M, N this $ will also be 7 ON, and it will also be finite ((1), p. 160, 
Lemma 7.3.5). We have C and contains A since M, do. So 
we have (1) for Mt, $ and for MN, B (in place of Mt, M), and hence for Mt, MN too. 

Thus the proof is completed. 

1.2.4. Considering Lemma 1.2.5 we can drop the It fromthe 
TH) (A) of Definition 1.2.3. We shall therefore write for every A «MM of finite 


rank (in the sense of Definition 1.2.3): 
To (A) = Tip) (A). 
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We obtain now with ease: 

TueoreM I. Toq(A) has the properties (ii)-(vi) in “Property III” in (2), 
pp. 218-219). That is: 

(ii) Toy(aA) = aToy(A), a a complex number. 

(iii) Ton(A + B) = Toy (A) + Ton(B). 

(iv) Toy(A) = 0 of A ts definite. 

(iv)’ Tay(A) > 0 if A is definite and ¥ 0. 

(v) Toy(A*) = 

(vi) Toy(AB) = Toy (BA) if either A or B has a finite rank, and both are « M. 
(vi)’ Tay(U" AU) = Toy (A) if U (e MN) is unitary. 

We have also: 

(vii) Toy (E) = D(B) if E (eM) is a finite projection. 

(Throughout (i)—(vii) remember Lemma 1.2.2.) 

Proor: Ad (ii), (iv), (iv)’, (v): They follow immediately from the corre- 
sponding statements in “Property III” in (2), pp. 218-219. We must only 
observe for (ii), (v), that 9% contains aA, A* along with A. And for (iv)’, 
that if M contains A, then Aim) = 0 implies A = 0. 

. Ad (iii) and (vi), when A, B have both finite rank: In order to deduce them 

from the corresponding statements of “Property III,’ as above, we only need a 
finite My Oi which contains both A, B. Let M, Rn OM be finite and contain 
A, B respectively. Then $ = [M, I] meets all requirements. (For the finite- 
ness by (1), p. 160, Lemma 7.3.5.) 

Ad (vi): Since (vi) is symmetric in A, B, we may assume that A has a finite 
rank. So has BA, choose a finite Jt 79 which contains both A, BA (ef. 
above). So we have for EH = P» by Lemma 1.2.3, AE = A, EBA = BA. 
Consequently 


AB = AE-B = A-EB, 
BA = EBA = EB.A. 

Now A has a finite rank, and EB has also a finite rank along with E. So our 
previous result gives Toy (A-EB) = Toy(EB-A), and consequently Toy (AB) 
= Toy (BA). 

Ad (vi)’: Put UA, U for A, Bin (vi). U~*A has a finite rank along with A. 

Ad (vii): Put EF = Pg, M is finite along with E, it is 7 MM, and obviously 
contains E. Ey, = I is the unit in M, hence for Mim). So we obtain 
from Definition 1.2.3, considering (i) in “Property III,” as above: 


= DM)TIH = DM). 


1.3. Our new relative trace Toy (A) was only defined for the A «9 with a 
finite rank. We now wish to extend its definition to a wider domain of operators 
A €Ol, with the help of a suitable limiting process. 
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We first introduce a norm in the space of all A ¢ OM of finite rank, to be 
denoted by [[A]], which is analogous to the notion of (2), p..241, Definition 4.3.1, 
and Lemma 4.3.2 (especially the latter). 

Derinition 1.3.1. If A € 9M has a finite rank, then we define the “norm” of A, 


a number [[A]] = 0, by 
= (Ton(A*A))’ = (Ton (AA*))*. 


(Remember Theorem I, (iv), (vi). A*A, AA* are obviously definite.) 

TueoreM II. [[A]] has the characteristic properties of a norm in a linear 
space. That is: 
(i) Always [[A]] 2 0, and = 0 ¢f and only if A = 0. 
(ii) [[aA]] = | a|-[[A]] for every complex a. 
(iii) + B]] [[A]] + 
[[A]] also possesses the following further properties: 
(iv) [[A*]] = [[A]l. 
(v) {(AB]] (2 ||| A |||-[[B]], where only B need be of finite rank, 

< ||| B|||-[[A]], where only A need be of finite rank. 

(vi) | Toq(AB) | S [[A]]-[[B]], where A, B must both be of finite rank. 
(vii) [[Z]] = (D(B))', where E is a finite projection. 

Proor: We prove these statements in a somewhat changed order: 
Ad (i) : Immediately by Theorem I, (iv), (iv)’ if we recall that A*A = 0 implies 
A=0. 
Ad (ii): Immediately by Theorem I, (ii). 
Ad (iv): Immediately by the form of Definition 1.3.1. 
Ad (vii): Immediately by Theorem I, (vii), since E*E = E’ = E. 
Ad (v), first inequality: Put ||| A ||| = a, then a’J — A*A is definite,’ hence 
B*(a@I — A*A)B is definite.’ Now 


B*(a’I — A*A)B = a’B*B — B*A*AB = a’B*B — (AB)*(AB), 


and so this expression is definite too. Therefore Theorem I, (iv), gives, together 
with (ii), (iii), eod.: 


0 < Toy(a’B*B — (AB)*(AB)) 
= Toy (B*B) — Toy ((AB)*(AB)) = — [[ABIl, 
[[AB]] a[[B]] = ||| A ||| 
Ad (v), second inequality: Combine the above result with (iv): 
[[AB]] = [[(AB)*]] = [[B*A*] ||| B* = B 


1 || Af ||? = (Af, Af) = (A*Af, f) = 0, || Af || = 0, hence A = 0. 
((a*l — = f) — (A*AJ,f) = f) (Af, Af) = a2 || f |] Af 
3 B*CB is always definite along with C: 

(B*CBf, f) = (CBf, Bf) = 0. 
4 ||| B* ||| = ||| Bll, ef. (5), p. 384, footnote 36. 
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Ad (vi): Considering (iv), we might as well prove 
(5) | Toy (A*B) | S 


In what follows we shall make use continuously of Theorem I, (ii), (iii), (v), 
and of (ii) above: 


0 < [[A — = Toy((A — B)*(A — B)) 
= Toy (A*A + B*B — A*B — B*A) 
= Toy(A*A) + Toy (B*B) — Toy (A*B) — Ton (B*A) 
= + — To(A*B) — Toy(A*B) 
= [[A]P + [IB]? — 
Replace A, B by aA, ‘B, where a > 0; then this becomes 


The greatest lower bound of the right side for all a > 0 is [[A]]-[[B]], hence 
RT Q(A*B) 
Now replace B by e'“B, & real, then this becomes 
R(e"*Toy(A*B)) < 


The maximum of the left side for all real a is | Tay(A*B) |, hence (5) results. 
Ad (ili): Use Theorem I, (iii), and (iv), (vi) above: 


+ = Toy((A + B)*(A + B)) = Toy (A*A + B*B + A*B 4+ B*A) 
Toy(A*A) + Toy(B*B) + Toy(A*B) + Toy(B*A) 


< + + =- + 
[[A + B]] [{A}] + 


Throughout what follows we shall make use of the results of these Theorems I 
and IT without referring to them explicitly. Theorem II, (i)-(iii), give us also 
the right to use [[A]] as a norm and [[A — B]] as a distance, without any further 
comment. 

In ||| A ||| and [[A]] we have two different numerical evaluations for the 
size of A. It is therefore desirable to evaluate each one in terms of the other. 
This is only feasible to a very limited extent, and is expressed by the Lemmas 
1.3.1, 1.3.2, which follow. (Cf., however, B.3 in (B) in §1.6, when 9 is in 
class (I,).) 


| | 
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Lema 1.3.1. [[A]] S$ (Rank (A))’- || A ||]. 

Proor: Put M@ = [Range A], EH = Pm. Then A eM implies M Mi, 
E €M (ef. Definition 1.2.2), and D(E) = D(M) = D({Range A]) = Rank (4A), 
[[E]] = (D(B))' = (Rank (A))'. Every Af belongs to M so HAf = Af, that is 
EA = A. Hence 


([A]] = 4 ||| = (Rank 4 


Lema 1.3.2. If [[A]] S de with 6, « > 0, then there exists an EF eM with 
D(E) 80 that ||| AZ — £) ||| 
Proor: A*A is Hermitean and bounded, so it has a spectral form 


6) (A*Aj, 9) = [rd BOY, 


(This is an application of the well-known spectral theorem for bounded oper- 
ators, cf. e.g. (5), pp. 389-390, footnote 42, and p. 418, or (1), p. 212, Definition 
15.1.1. The Stieltjes integral converges, cf. eod.) E(A), —-7 <A < +>, 
is a resolution of unity (cf. eod.). Put f = g in (6), since (A*Af, f) = 
(Af, Af) = || Af |, (BOOS, = |), we then obtain 


For \> < 0 put f = E(Ao)g. Then the right side in (7) becomes 


The left side of (7) is = 0, hence Ay < 0 gives || E(A0)g ||? < 0, so ||H(Ao)g || = 0, 
E(\o)g = 0. That is: 
(8) 4 <0 implies E(A) = 0. 


Now (7), (8) give for f = E(é*)g: 


that is 
(9) ||| AZ@) ||| 6. 
On the other hand we have by (7), (8) 
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= [0 - 0. 


So A*A — &(I — E(6’)) is definite, hence 
= Toy(A*A) = — = — E()). 
But [[A]? < consequently 
(10) DI — &. 
So (9), (10) state that E = I — E(é) meets all our requirements. 


1.4. DeriniTION 1.4.1. A sequence of operators A;, As, --~ is called “regular” 
if it possesses the following properties: 
(i) 
(ii) All A, have finite ranks. (But their ranks need not be bounded.) 
(iii) The numbers ||| A: |||, ||| Ae |||, --- are bounded. 
Derinition 1.4.2. A regular sequence A; , Ac, --- is “fundamental” if 


lim [[An — A,]] = 0. 


DeFINITION 1.4.3. Two regular sequences A,, Az, --- and B,, Bz, are 
“equi I af 


lim [[An B,]] = 0. 
no 
In all our considerations about operator-convergence which will follow, we 
shall use strong convergence. This is the definition (cf. (5), pp. 381-382): 


DEFINITION 1.4.4. A sequence Ao,--- is “strongly convergent” to the 
limit A, in symbols 


lim str A, = A, 
if these conditions are fulfilled 


(i) The numbers ||| Ax |||, ||| As |||, --- are bounded. 
(ii) For every f e S 


|| Aaf — Af || = 0. 


t 
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We shall use only the most obvious properties of strong convergence. (Cf. (5), 
pp. 382-384. In Definition 1.4.4 above, (i) is a consequence of (ii)—ef. (5), 
p. 382, footnote 35—but we do not need this fact.) 

We proceed now to establish connections between fundamental sequences 
and equivalence on one hand, and strong convergence on the other. 


Lemma 1.4.1. Let Ai, Ao, --- be a regular sequence with 
lim [[A,]] = 0. 


Then it possesses a subsequence A;,, Ai,, +++ (in > ©) with 
lim str A;, = 0. 


Proor: For every m = 1, 2, --- there exists an m = o(m), such that n = m 
implies [[A,]] S$ 1/4”. Choose i; < % < --+ witht, 2 mo(m). Then we have 


1 
(sll S$ 


Apply Lemma 1.3.2 with 6 = « = 1/2”, then an E£,, e Mi obtains, with 


(11) D(En) < 


Put En = Pm,,, Nm = [Mm, Mmii,-::], Fm = Px,,. Then we have also 
F, €M, and (1), p. 168, Lemma 8.3.2 gives 


< DM), 


1 1 
D(F») < 
1 

13 < 
(13) Da) 
Also, by construction of the Im, Mm D Nir, Pm 2 Fmii, that is 
(14) 


Hence by (4), p. 77, Theorem 19, we have lim strnio Fm = F = Py,.2. ...- 
and by (1), p. 169, Lemma 8.3.4, 


---) = lim D(®%,),  _D(F) = lim D(F,). 


This is = 0 by (13), so F = 0, that is 
(15) lim str F,, = 0. 


(16 

By 

hol 
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of 
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(12) states || — Em)f || S 1/2” ||f ||, hence for fe — Mn 
1 


By construction of Im Mm C Mm, hence H — Mn D H — Mn, and so (16) 
holds for all fe  — Nn. Assume finally fe H — N,, m = p. Then (14) 
gives Ny D Nn so H — Ny © H — Nn, and so (16) holds again. 

So if fe & — N,, then (16) holds for all m = p. That is: Then 


As (17) holds in every § — %, and considering (15), we see that the domain 
of validity of (17) is an everywhere dense set in §.° 

But the domain of validity of (17) is a closed set, because the A;, As, --- 
are uniformly continuous by (iii) in Definition 1.4.1. Hence it comprises all ©. 
Thus (ii) in Definition 1.4.4 is established for the sequence A;,, Ai, , --- 
(i) eod. follows from (iii) in Definition 1.4.1. 

Lemma 1.4.2. Let Ai, Ae, --- be afundamental sequence. Thenlimstrn.. An 
exists. 

Proor: Since (i) in Definition 1.4.4 follows from (iii) in Definition 1.4.1, 
we must only prove the validity .of (ii) in Definition 1.4.4 for a suitable A. 
That is: For every f « S the existence of a g e S (which then is to be Af) with 
limnso || Anf —g|| = 0. And since is complete, this is certainly the case if 
(18 lim || — || = 0. 


Therefore assume the opposite: That (18) is not true for a suitable f « §. 
So we have an fo e and a 6 > 0, such that 


(19) || — Aj 26 for n=1,2,---, where 


From the fact that A,;, Az, --- is a fundamental sequence, we conclude 
immediately that A;, — Aj,, Ai, — Aj,, +++ is a regular sequence with 
lim» +x [[Ai, — A;,]] = 0. Therefore we may apply Lemma 1.4.1 to Ai, — Aj, , 
Ai, — Aj,,--+ (in place of A;, Az,---). This sequence possesses a subse- 
quence A;,, — Ax, , Aig, A ing) (kn ©), with lim (Ai, — A ix,) 
= 0. Thus in particular lim,., || (Ai, — Ai,,)fo|| = 0, that is 


(20) lim || As,,fo — Aj,,fo || = 0, 
(19) and (20) contradict each other, and this completes the proof. 


Lemma 1.4.3. Let Ai, Ac,--- be a fundamental sequence with lim sttn. 
A, = 0. Then it possesses a subsequence Ai,, Ai,,--+ (in > ©) with 


lim [[A;,]] = 0. 


_ Consider an aribtrary f ©. Then fp = (I — F,)f — Np, and limy.,,, || f — fr || = 
limp. || || = 0 by (15). 


lim || Af || = 0. | 
| 
™ 
| 
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Proor: Since A,, Az,--- is a fundamental sequence, we have lim,, ,_., 
[[4m — An]l] = 0. Hence there exists for every p = 1, 2,--- an m = m(p) 
such that m, n = m imply [[A» — A,]] S 1/4’. Choose 1 < tz < --+ with 
ip = no(p). Then we have 


| 1 


Apply Lemma 1.3.2 with 6 = « = 1/2”, then an EZ, ¢ Dil obtains, with 


RE 1 
(21) 


Let us now proceed as in the proof of Lemma 1.4.1. Put Ep, = Pm,, N, = 
Mou, ---], Fp = Px,. Then we have again F,, «9, and as there 


D(®,) < x D(Mz), 


1 


By the construction of the Mir, so Ni D Me D --- , and 
H-MCH-Mc---. 


(24) 


Consider now anf e $ — N,. Then we have for every p = qalsofe H — N, 
by (24), hence (I — E,)f = f. So (22) gives for such an f 


— Aga ll SAIS 


If r > s 2 q, then the above inequality may be applied to p = s,s + 1, ---, 
r- 


Adding gives 
s SA sis 


| 
| ‘i 
d 
(25) 
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Now lim An = O implies limy.. || A,f || = 0, and a fortiori lim,.., 
|Ai,f || =0. Soifs2q is fixed, and we let r ~ with r > s then (25) gives: 


(26 Il S IIS if's 


This is true for every fe H — MN, that is for every f = (I — F,)g. Substi- 
tuting this into (26) gives: 


1 
Fog S ll @ - Fog ll ll. 


That is: 
(27 Il 4, — Fe) S 
Having obtained (27), we can now proceed to a general evaluation of the A, . " 


Since the numbers ||| A; |||, ||| Ae |||, --- are bounded, by (iii) in Definition 
1.4.1, there exists a fixed number a with 


(28) Aa ||| for n = 1, 2, --- 
Hence (23), (28) give: 


Consider now any finite ae GeM. Then (27), (29) yield: 
([GA;,]] S [[G@A;, + — F,)]] 
S ||| @ - (4s, Fel] + — F,) - 


S1- t 
(D(@)' 
+ 
(D(@) 


Put = [Range A;], G, = Again OM, G,e And always 
Aif By, = Af, that is G,A;, = Ai. So (I — = 0, and 
consequently 

(I — G,)(Ai, — Au) = — G)Ai,. 


| 
| 

| 

| 


= 
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Hence, by using our original construction of thet, %, --- , 
— = — G,)(Ai, — 


1 1 


(31) (U- GAM 


Now combine (30) with G = G, and (31), remembering that D(G,) = D($,) = 
Rank (A;,). So we obtain: 


a +4 (Rank (A;,))' 


ifs => q. 
(3-4) 


Let finally an e > 0 be given. Choose aq = q(e) so that 


nk (4i)) Then (32) 


280-1 


gives immediately for all s => s that [[A;,]] < «. In other words: 
lim [[A;,]l = 0. 


and then an s = g) so, that 2 and 


Thus the proof is completed.— 

Before we go on, we observe this: Both Lemmata 1.4.1, 1.4.3 could have been 
easily strengthened by replacing the subsequence A;, , Ai, , --- by the sequence 
A,, Az, --- itself. But they suffice as they stand for our immediate purposes, 
and our ultimate results (Theorem III) yield those strengthened forms im- 
mediately. 

THEoREM III. (i) For every fundamental sequence A, , Az, --+ lim An 
exists, and belongs to SM. 

(ii) For two fundamental sequences A; , Az, --- and B,, By, --- the equation 


lim str A, = lim str B, 
holds if and only if the two sequences are equivalent. 
(iii) For every fundamental sequence A,, Az, --+ limn.,,, [[An]] (numerical!) exists, 
and it depends on lim str,..,, An only (and not on the sequence A, , Az, --- ttself!). 

Proor: We prove these statements in a somewhat changed order: 

Ad (i): Existence of lim strnz.,, An : Identical with Lemma 1.4.2. 
Ad (i): lim str,.,, An e MM: Clear, since Ai, Az, --- € MM, and MM being a ring, 
is closed in the strong topology. 
Ad (ii): Sufficiency: Let Az,--- and B,,Bz,--- be two equivalent 
fundamental sequences. Then A;, B,, Az, Bo, --- is also a fundamental 
sequence; denote it by C;, C:,---. By (i) above lim str,_,,, C, exists. Hence 
lim = lim strn.,, Con , that is lim str,.,, An = lim str», Bn. 
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Ad (iii): Existence of limy.,, [[An]]: Since Ai , Az, --- is a fundamental sequence, 
we have limmne [[4m — Aa]l] = 0. Now| [[Am]] — [[Aa]] | — A,l],” 
hence limMm,no ({[Aml] — [[Anl]) = 0. Therefore the (numerical) limit 
lim,» [[An]] exists. 
Ad (ii): Necessity: We may replace A;, --- and B,, Be, --- by Ai — Bi, 
A, — Bz, --- and0,0,---. Thatis: We may assume, that B,; = B, = --- = 0. 
So A,, As, --- is fundamental, and we assume that lim str,.,, A, = 0. Then 
Lemma 1.4.3 excludes that lim,.,, [[A,]] should exist and at the same time 
be 0. But we proved just now that part of (iii) which states that lim,.,, [[A »]] 
does exist. Hence lim,.,,[[A»]] = 0. Inother words: A;, --- and0, 0, --- 
are equivalent. 
Ad (iii): lim str,.,, An determines lim,..,, [[A,»]]: We have two fundamental 
sequences A,, Az, --- and B,, By, --- with lim str,.,, A, = lim str,.,, Bn, 
and we want to prove that lim,.,, [[A,]] = lim,.., [[B,]]. We proved just now 
the necessity of the criterion of (ii), hence A; , Az, --- and B,, By, --- are at 
any rate equivalent. That is: lim,.,, [[A. — B,]] = 0. But | [[A,]] — [[B,]] | 
< [[A, — (ef.® above), hence lim,.,, (([An]] — [[Bnl]) = 0. Consequently 
lima [[An]] = [[Ba]]. 

We can now define: 

DertniTion 1.4.5. The fact that Ai, Az,--- is a fundamental sequence 


and that lim str;.,, An = A, will be denoted by 
(Ai, As, WA, 


DEFINITION 1.4.6. We shall say that an operator A is “normed” if a sequence 
A, Ao, with (Ai, Ae, ---) ~ A exists. By (iii) in Theorem LUI, lim,., 
[[A,]] exists for every such sequence, and it depends on A only (not on the sequence 
A,, Ao, +++ itself!). We shall define the “norm” of A, the number [[A]], as this 
limit: [[A]] = lim,.,, [[Aal]. 

Remarks: (i) Every A of finite rank is normed, since we then have 
(A, A,--+) ~ A. This makes clear also that in this case—i.e., when the old 
[[A]] is also defined—the new [[A]] coincides with the old one. 

(ii) By (i) in Theorem III every normed A is ¢ DM. 
(iii) By (i), (ii) in Theorem III the relation 


(Ai, A2,---)~ A 


establishes a one-to-one correspondence between all equivalence-classes of 
mutually equivalent fundamental sequences Ai, Az, --- on one hand, and all 
normed A on the other.— . 

Throughout what follows we shall make use of the results of Theorem III. 
Definitions 1.4.5, 1.4.6, and of the above Remarks, without referring to them 
explicitly. 

"Always | ([A]] — [[B]]| [[A — B]]. Proof: = [[B +(A — B)]] + 


— B}], — [{B]] [[A — B]].  Interchanging A, B gives — [[A]] < [[A — B]]. 
Hence + ({[A]] — [[B]]) < {{A — that is | — | — 


w 
2 
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1.5. Lemma 1.5.1. (i) (Ai, Ag, ~ A implies (aA,, aAz,---) ~ ad 
for every complex a. 

(ii) (Ai, As, ~A, (Bi, Be, ~ B imply (Ai + Bi, Ar + Be, 
A+B 

(iii) (Ay, ---) ~ A implies (A1B, A2B, ---) ~ AB and BA2, ---) ~ 
BA for every B e M. 

(iv) (A1, Ao, ~ A is equivalent to (AT, Az, ---) ~ A* 

Proor: Ad (i): a@A;, @Az2, --- is fundamental along with Ai, As, --- because 
[[aAm — aAp]] = [[a(Am — Ax)]] = | a|-[[A» — A.J]. And lim str,.,, A, = A 
implies lim str,.,, dA, = @A. 

Ad (ii): A; + Bi, Ao + Be, --- is fundamental along with Ai, Ae, --- and 
B,, Bz, --- because [[(Am + Bm) — (An + Ba)]] = [[(Am — An) + (Bu - 
B,)]] [[Am — + [[Bx — B,J]. And lim str,.,, A, = A, lim str,.,, B, = 
B imply lim str,.,, (An + Bn) = A+B. 

Ad (iii): AiB, AoB, --- and BA,, BAg, --- are fundamental along with A;, 
Ag, --+ because 


[[BAm — BA,]] = [[B(Am — Ax)]] S ||| — Anal]. 


And lim strnz.,, An = A implies lim str,.,, A.B = AB, lim str,.,, BA, = BA. 
Ad (iv): Since ** is identity, it suffices to prove the forward implication. Now 
a. --+ is fundamental along with A;, Az, --- because [[Ax = 
[[(Am — An)*]] = [[Am — Anal]. Hence a B with lim str,.,, A, = B exists, and 
we know that we have lim strz.,.4n = A.’ Hence we have for every fe 
limn+. || Anf — Af || = 0, || Anf — Bf || = 0. So for all f, g © lim,..,, 
(Aaf,g) = (Af, 9), lime. (Ans, 9) = (Bf, 9). 
Replacing f, g in the first relation by g, f and application of © gives lim,.,, 
(Axf, g) = (A*f, g). Hence (A*f, g) = (Bf, g) for allf,g«$. Thus A* = B, 
and consequently lim str,.,, A, = A*. This completes the proof.— 

It is now easy to prove the fundamental properties of the class of the normed A 
and of their norm [[A]]. 

THEOREM IV. [[A]] (for normed A) has the characteristic properties of a norm 
inalinear space. Thatis: 
(i) Always [[A]] = 0, and = 07f and only if A = 0. 
(ii) aA is normed along with A, and [[aA]] = | a |-[[A]] for every complex a. 
(iii) A + Bis normed along with A, B and [[A + B]] s [[A]] + [[B]]. 
The following further statements are true: 
(iv) A* is normed if and only if A is normed, and {[A*]] = [[A]]. 
(v) AB is normed if either A or B is normed (but both « MM), and 


S ||| A ||| B ts normed, 


[[AB]] 
||| B|||-[[A]] A ts normed. 


7 lim stra.(Ax) = (lim str,.,An)* is not a generally valid equation in the sense that 
the existence of either side implies that one of the other side also, (Cf. (5), p. 382.) 
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(vi) If Ar, As, 8 regular sequence and A is normed, then (A; , Az, ~ A 
is equivalent to lity. — A]] = 0. 

Proor: Ad (i): Choose an (Ai, A2,---) ~ A. Then [[A]] = lim,., [[Aal]. 
since all [[4a]] 2 0, so [[AJ] 2 0. And = 0 means [[Aal] = 0, that 
is, the equivalence of Ai, Az, --- and 0,0,---. But as (Ai, As, ---) ~ A, 
(0,0, ---) ~ 0, therefore this means A = 0. 

Ad (ii), (iii), (iv), (v): Immediately by (ii), (iii), (iv), (iii) in Theorem III, 
respectively. (In the second part of (v) interchange A and B.) 

Ad (vi): Sufficiency: Assume (Ai, ---) ~ A. Clearly (—A,, ---) 
~ —A,,, hence by (ii) in Lemma 1.5.1. (Ai — An, A2 — An, --:) ~A— An. 
Therefore limm.,, [[Am — An]] = [[A — Aal]]. Now A;, As, --- must be 
fundamental, so limm,n—ro [[Am — An]] = 0. Since limm.,, [[Am — Anl] exists, we 
have a fortiori lim,.,, (limn—,, — A,]]) = 0, limy.,, [[A — = 0, that is 
lima [[An — A]] = 0. 

Necessity: Assume lim,.,, [[An — A]] = 0. By (iii) above, [[Am — Aal] = 
[[(Am — A) — (An — A)]] S [[Am — A]] + [[A. — Al]. Hence the above 
assumption implies limm,»,, [[Am — An]] = 0. So Ai, As, --- being regular, 
is even fundamental. Therefore a normed A’ with (A;, Az, ---) ~ A’ exists. 
Hence by our above result lim,.,, [[A. — A’]] = 0. Now again by (iii) above 
— A] = [[(An — A’) — (An — A)]] S — A] + — Al]. As 
lity [[An — A’]] = lim,.,, [[An — A]] = 0, this proves [[A — A’]] = 0. There- 
fore by (i) above, A — A’ = 0,A = A’. Hence (Ai, Az, ---) ~ A. 

Lemma 1.5.2. If (Ai, Ag,---) ~ A, (Bi, Be,---) ~ B, then lim,., 
To(AnBx) exists, and it depends on A, B only (not on the sequences Aj, Aa, --- 
and B,, Bz, --- themselves!). 

Proor: It suffices to prove thisforlim,..,, R79, (ABs) andlim,..,, 
separately. And since the latter obtains from the former by replacing A and 
the A, by 7A and the 7A, , so we need to consider lim,..,, RTO (A,B>) only. 

Now 


([[A, + B,J) — — 
= + Bn)(An + Ba)*) — An) — Ton(BaBa)) 
= (Toy (AnBr) + Ton(BnA;)) 
= (Ton + = 
(33) R Toy(AnBr) = + Ball’ — — 


We have (A;, Ae, ---) ~ A,(Bi, Be, ---) ~ B, hence by (iii) in Lemma 1.5.1, 
also (A, + By, As + Be,---) ~ A+B. So lim,., [[A.]] = [[A]], lim... 
([B.]] = lim,..., [An + Bul] = [[A + Consequently (33) gives 


lim RTO (AnBs) = + — — 


| 
| | 
| 
e 
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This completes the proof. 
DeFiniTIon 1.5.1. Let A, B be normed. By Lemma 1.5.2 lim,.,, 
exists for any two sequences A, , Az, --- and Bz, --- with (Ai, A2,---) ~A, 
(B,, Bz, ---) ~ B, and it depends on A, B only (not on the sequences A; , Ao, --- 
and B,, Bz, --- themselves!). We shall define the “inner product” of A, B, the 
number KA, B>, as this limit: KA, B> = limy..,, T9y(AnBa).— 

i Throughout what follows we shall make use of the results of Theorem IV 
4 and Definition 1.5.1, without referring to them explicitly. 

Be Taxorem V. (i) «A, B>> is linear in A, conjugate-linear in B, and of 
Hermitean symmetry in A, B. That is 


«aA, B> = B>, 


B>, 


KA' + A", B> = + 


«A, aB> = B>, 


<A, B’ + B’> = «A, + «A, B’>, 


KA, B> = KB, A>. 


«KA, A> = 


«KAD, B> = «A, BD*>, 


«DA, B> = «A, D*B>, 


where A, B are normed and D ¢ M. 
Proor: All these statements are immediate by Definition 1.5.1, and the 

properties of T_(X) for X of finite rank. (Remember, in particular, that 

Toy(XY) = Toy (YX) if either X or Y has a finite rank, and both are ¢ 9il.)— 
We are now in a position to prove: 

Lemma 1.5.3. A projection E is normed if and only if it is finite. 

| Proor: Sufficiency: If E is finite, then it is of finite rank, hence it is normed. 

; Necessity: Assume that EZ is normed. We want to prove that it is finite, 

that is: If F e Mis a projection < H, with F ~ E ( --- MM) ((1), p. 151, Defini- 

4 tion 6.1.1), then F = E ((1), p. 155, Definition 7.1.1). So we assume that F 

has the above properties (excepting of course F = E). 


F ~ E(--- ON) means F = U*U, E = UU* for a partially isometric U ¢ ML 


e ((1), p. 151, Definition 6.1.1), that is a U with UU*U = U, U*UU* = U* 
: ((1), p. 142, Lemma 4.3.2). Then U = UU*U = EU and F = U*U are 
both normed, along with E. F < E means FE = F. 


N 
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Now we obtain, with the help of (i)-(iii) in Theorem V: 
KE, F> = «BE, F*F> = «FE, F> = «F, F> = (IFIF, 


«F, E> = «KE, F> = [IFIP = (IFIP, 
- = KE F, E- F> | 
E> + «F, F> — «E, F> — E> = [[E]) — 
(34) — = — (FIP. 
Also: 
(EI? = <E, E> = «KUU*, UU*> = KUU*U, U> = «KU, U>, 


(FI! = «F, F> = «KU*U, U*U> = KUU*U, U> = «KU, U>, 


(35) = [[FIP. 


Combining (34), (35) gives [[Z — FJ’ = 0, [[Z — F]] = 0, and consequently 
E=F., 

Thus the proof is completed.— 

After this we could determine the normed A completely by investigating the 
spectral form of A*A and the projections of its resolution of unity E(A), —° < 
4<-+2. Weshall not do this here, however, because for our further purposes 
(especially for Lemma 2.2.1) Lemma 1.5.3 is sufficient. 


1.6. We want to say a few words about the meaning of our above notions of 
normedness and norm in factors 9 of various classes. The contents of this 
section, however, are not needed for the understanding of our subsequent 
deductions. 

(A) MM cs of class (I,) (n = 1, 2, --- ) or (Ili): MM is in this case of a finite 
class, hence every IN OM is finite, every A ¢ OM has finite rank. To(A) is 
defined for all A ¢ 9M and coincides with the relative trace TQy(A). (We can 
choose M@ = H in Definitions 1.2.3, 1.2.4, and D(M) in the standard normaliza- 
tion.) Hence every A is normed, and [[A]], «A, B> mean the same thing as 
in (2), p. 241, Definition 4.3.1, and Lemma 4.3.2. 

(B) OM is of class (I,,): We might as well take DM as the ring 8 of all bounded 
operators in §. (Cf. (1), p. 173, Lemma 8.6.1.) Let gi, --- be a complete, 
normalized, orthogonal set in and represent every A « Dl = & by its matriz: 


A = {aj;}, i,j = 1,2,---, 


ai; = (Agi, 


t 
| 
3 
Be 
_ 
j 
i 
‘> 
| 


Choose D(Q) in the standard normalizations; then it is the common notion of 
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dimension. (Cf. loc. cit. above.) Hence Rank (A) has its usual meaning too. 


We show now: 

B.1. (i) If Rank (A) is finite, then ><? ai is absolutely convergent, and 
TRA) = aii. 
(ii) If Rank (A) is finite, then ai; |? is absolutely convergent, and 

PRroor: ‘Ad (i): If Rank (A) is finite, then Rank (A) = D ([Range A]) = k = 
0, 1, 2,---, hence [Range A] = --- , where --- , are a nor- 
malized orthogonal set. Put = 1 = 1,---,k, then Prrange a) = 
Ei + --- + Ex, 80 A = Pirange = + --- + Each has at 
most rank | so it suffices to prove (i) when the rank is at most 1. In this case 
clearly A= {a:;}, aij = a8; . If all a; = 0 or all B; = 0, then all ai; = 0 
in which case our statement is trivially true. Hence we may assume that some 
a; ~ 0 and some ~ 0. As every aj; |? and every | ai; |* must 
be finite,® this implies that both }°?., | a; |’ and >°3-1 | 6; |’ are finite. So we 
have: 


0< Ja P, < +o. 


This proves in the first place that }°7.1 ai: = 071 a8; is absolutely con- 
vergent (by Schwarz’s well-known inequality). Now put B = {b;;},C = {c,;}, 
where 


7\=0 otherwise 


0 otherwise 


Then BC = A, CB = {d;;}, where 


aB; for i,j = 1) 
dj; 1=1 
= 0 otherwise 


So CB = where Pi,,;. B, C are clearly of finite rank (= 1). 
Now we have (remember D(F;) = 1, hence Toy (£1) = 1): 


Ton(A) = Pon (BC) = = ( a6.) Tom (ts) = 


Thus the proof is completed. 

Ad (ii): Immediately by (i), since A*A = {giz}, giz = , hence 
gu = | Ji = Dis | ax; | 

B.2. A is normed if and only if ai; |’ is finite. Again [[A]P = 


| 


= | Aes, = Age IP, 80 | ai; |? is finite. The same 
results for | |* by considering A* in place of mn = {aj;}. 


He 
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Proor: Necessity: Assume that A is normed. Put E, = Pig,,....)- Then 
|| = 1, hence Z,AZ, is normed, and [[A]]. Now £,AE, 
has even finite rank, along with E,, and A ~ {a,;} implies E,AE, ~ {aj}?}, 
where | 

(n) ) = for 1,++-,n 
{= 0 otherwise 


Hence by (ii) in B.1: 


and consequently >-}je1 | as; |’ < [[A]]’ for every n = 1,2,---. This proves 
the finiteness of | aj 
Sufficiency, and value of {[A]J’: Assume that 7; | a;; |’ is finite. Form E, 
as above, put A, = E,AE,. Each A, has finite rank, and ||| A, ||| = 
||| ||| < ||| A |||. Hemee the sequence Ai, Az, --- is regular. It is 


also fundamental: If m = n, then Am — A, = {as7” — a{}?}, and 


al” — as” = aij for i,j = 1, ---,m, but not 7,7 = 1, ---,m 
i \=0 otherwise 


and consequently (using (ii) in B.1) 


— Aal? = las? af P= - | af 


i,7=1 


= 
So for m = n 


and so the finiteness of | |’ implies [[Am — Anl] = 0. 
Finally lim str,..Z, = I, hence lim sttn.An = lim stf.E,AE, = A. 
Thus we have 
(A;, Ae, ) ~A. 


Therefore A is normed, and (using (ii) in B.1) 


= lim [[A, J = lim >> = lim = 
ne i,j=1 i,j=l 

Thus the proof is completed.— 

B.2 shows that in this case the class of all normed A coincides with the well- 
known class of E. Schmidt. 

We finally observe that in this case the condition limm,n+« [[Am — Anl] = 0 
of Definition 1.4.2 (which implies the existence of limn.. [[An]] hence the 
boundedness of the sequence [[A1]], [[As]], --- ) implies (iii) in Definition 1.4.1. 
So in the present case (I,,) condition (iii) in Definition 1.4.1 is superfluous— ; 
which is not true in the case (II,,), as will be seen in C.1 below. i 


fi 
| 
| 
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Indeed we have: 

B.3 ||| A ||| < [[A]] of A ts normed. 

Proor: That is: || Af || < [[A]]-||f|| for every fe. We may obviously 
assume || f || = 1, and then we can choose the complete, normalized orthogonal 
set g1, ¢2,--- so that g: = f. So we want to prove || Ag; || < [[A]] that is, 
|| Ag: ||’ [[A]?. This, however, is obvious.” 

(C) ON is of class (II,,): In this case our notions are genuinely new. We 
only wish to observe that the opposite of B.3 is true, which justifies our formula- 
tion of Definitions 1.4.1, 1.4.2. (Cf. the above discussion before B.3.) Indeed 
we have: 

C.1. Even for an A of finite rank ||| A |||/[[A]] can be arbitrarily great. 

Proor: Given any C > 0, choose a projection E e 9 with D(Z) > 0, < 1/C”. 
Then E is finite, hence of finite rank. ||| Z ||| = 1, [[Z]] = (D(2))' s 1/C, 
and consequently ||| |||/{[Z]] = C. 

(D) ON is of class (III,): If A is of finite rank, then [Range A] is finite, 
hence (since 9 is of class (III,,)!) [Range A] = 0, A = 0. Hence for every 
regular, and a fortiori for every fundamental sequence A;, Az, --- necessarily 
A; = Az = --- = 0. Thus the only normed A is A = 0. 

In this case, therefore, all the notions we have defined are vacuous.— 

In spite of this, just these notions will help us to prove that the examples of 
factors which we shall construct in §§4.3-4.4, belong in certain cases to class 
(III,,). (Cf. in particular §4.3, Theorem IX.) The procedure will be essen- 
tially the reverse of (C) above: We shall succeed in showing that in those 
factors Dil every finite projection Z must be = 0, the proof being mainly based 
on the properties of the notion of normedness. From this we then conclude 
that such an 9 must be of class (III,,). Any direct attempt to prove such a 
thing will show that of all possibilities alternative to (III,,) the case (II,,) is the 
hardest one to exclude. It is therefore very appropriate that our notions of 
normedness and norm are in this case genuinely new. 

We also want to point out, that Theorems IV, V show, that the normed 
operators in 9 form a (possibly) incomplete Hilbert space. This space is 
actually complete in the following cases: (I,): Obvious. (I,): Clear by (B) 
above. (III,,): Clear, since it contains 0 only, cf. (D) above. In the cases 
(II,), (II,,) it is not complete, but can be completed to a space Q(ON), by 
adjoining to MN certain (but not all) unbounded operators 7 9. This process 
was carried out in detail for (II,) in (2), pp. 236-242. In this connection C.1 
above is significant. We will not go here any further into this subject. 


Cuaprer II. Maximum Rincs THE Operations 


2.1, DEFINITION 2.1.1. For any bounded operator A and any projection E 
we define : 
A'* = EAE + (I — E)A(I — EB). 


9 || Ag: ||? = | ai; |? by footnote 8, [[A]]* = | ai; |? by B. 2. 
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Consequently 
A —A'* = (I — E)AE + EA(I EB). 


Lemma 2.1.1. (i) If A, Ee 9M then eM. 
(ii) If A, EM and A is of finite rank, then A'* is also of finite rank. 

Proor: Both statements are obvious. 

Lemma 2.1.2. A!” commutes with E. 

Proor: Results by an obvious computation. 

LemMA 2.1.3. If Ei, ---, EH, commute with each other, and if (m, +++ , mn) 
is a permutation of (1, --- , ), then 


Proor: It is obviously sufficient to consider the case where (m, --- , tn) is 
just a transposition of two neighbors, say of m, m + 1 (m = 1,---,n — 1). 
Then we must prove nut B= and apply 
i¥m+2l--"l¥m to the result. Writing A, E, F for B, Em, Ems: we see: We have 
to show 

Alsi? — 


if Z, F commute. This, however, results by an obvious computation. 

Corottary. If Ey, ---, En commute with each other, then com- 
mutes with all of them. 

Proor: Consider ani = 1,---,n. Let (am, ---, mn) be a permutation of 
with = 7. Then by Lemma 2.1.3, and 
this commutes with Z,, = EH; by Lemma 2.1.2. 

Lemma 2.1.4. ||| = A 

Proor: We shall prove ||| < ||| From this 
the desired result obtains by putting m = 1,---,n. Write in the above 
inequality A, E for #, Then we obtain ||| A!” ||| < ||| A ||. 
In other words: We want to prove 


Now clearly ; 
(A'"f, 9) = (EAEf, g) + (I — E)A(I E)f, 9) 

= (AEf, Eg) + (AU — Bf, (I — E)g). 
Hence 
87) | (ANY 9) SNA Bo + @ — |). 
By Schwarz’s well-known inequality 


Bf\|-|| Eo || + || @ — @ Bg || 


t 
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But 
= (Ef, f) + (2 — Ef, f) = = Ils if, 
similarly 
|| Zg |? + || — Bgl? = 
Therefore 


And this carries (37) over into (36), and thereby completes the proof. 
Lemma 2.1.5. Assume that A, E,,--- €M, A of finite rank, the E,, 
E,, --- commute with each other. Then 


ifm Sn, m,n = 1, 2,---. 


Proor: All A!”!!*?!"" are of finite rank by Lemma 2.1.1, (ii), so all quantities 
which occur in the above equation are defined. We want to prove 


(38) (LX + Y]P = + (VIF 
Now 


(xX | 
= KX, X>+ «<Y, YO + «X, YO + X» 
= [[XIP + + 2R«xX, Y>, 

hence (38) is certainly a consequence of 

(39) KX, YO = 0. 

We shall prove (39) for the above X, Y. And since 


, 
so we may prove (39) for X = yp — _ 
withp <np+1 sn. 

Let (m,---,mn,) be a permutation of (1,---,n) with mm = p + 1. 
Write B, C, E for then we see: We need to 
prove (39) for X = B'*, Y = C — C'* only. By Definition 2.1.1, B'® = 
EBE + (I — E)B(I — E),C — C'* = (I — E)CE + EC(I — E). Hence we 
need to prove (39) only for the following combinations of X, Y: 


ir = EBE or (I — E)B(I — B), 
Y=(W-H)CE or EC(I — 


(40) 


ex 


(i 
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Now we have for the various alternatives of (40) 


a1) FX = X,FY=0 for F=E or I-E, 


Owing to (41) we now have: 


Either «X, Y> = «FX, Y> = «KX, FY> = 0, 
or «X, Y> = «XF, Y> = «X, YF> = 0. 


Hence (39) holds in any case. 

Thus the proof is completed. 

Lemma 2.1.6. Let A, Hi, Es, --- be asin Lemma2.1.5. Put A, = 
forn = 1, 2,---. Then Ay, Ae, --- a fundamental sequence. 

Proor: Ai, Az, --- is a regular sequence: We must verify the conditions 
(i)-(iii) in Definition 1.4.1. 
Ad (i): Obvious. 


Ad (ii): Immediate by repeated application of Lemma 2.1.1, (ii). Ay 


Ad (iii): Immediate by Lemma 2.1.4. 
A,, Az, --- is a fundamental sequence: We must verify further the condi- 
tion of Definition 1.4.2, that is 


(42) lim [[An — Aal}’ = 0. 


Now Lemma 2.1.5 gives [[Am]]° = [[Aal]’ for m S n, 


= = --- = 0. 
Therefore lim,_..» [[A,]]° exists. Hence 


(43) im | [[Aal} | = 0. 
But 


Indeed: By symmetry in m, n we need only to prove (44) for m < n, and then 
it follows from Lemma 2.1.5. 

Now (48), (44) give together (42). : 

DurinitTion 2.1.2. Let A, Ei, E2,+-- be as in Lemma 2.1.5. Then there 
exists by Lemma 2.1.6 a unique A, so that 


(A,, As, +--+) ~ A, where Aq = 
We shall write 
A = 


Tazorem VI. Let A, E,, E2,--- be as in Lemma 2.1.5. Then we have: 
(i) normed. 
(ii) commutes with E,, Es, 


or XF =X,YF=0 for F=F or I[-E. | 


| 

m,n 

‘ 

if 
; 

& all 
i 
| 

az 


122 J. V. NEUMANN 


Proor: Ad (i): Obvious by Definition 2.1.2. 
Ad (ii): Consider a fixed i = 1,2,---. Forn 2% A, = A!™!'"''*= commutes 
with E; by the Corollary to Lemma 2.1.3. But A!™!!*'"" = lim strs.,, A, by 
Definition 2.1.2, hence A'*''”*'"* commutes also with E; . 


1 | 2.2. DerIniTion 2.2.1. A ring 2C OM will be said to be “purely infinite” 
: ; uf every projection E ¢ & is either = 0 or infinite (with respect to M, ef. (1), p. 155, 
Definition 7.1.1).— 

| ( Observe that the following terms: finite, infinite (ef. loc. cit. above) of finite 
M rank (Definition 1.2.1) normed (Definition 1.4.6) are always to be understood 
; with respect to MN. Being purely infinite, however, is defined in Definition 
2.2.1 above for any ring 2C Ml. But it has, of course, for L = Mi the old 
meaning of 9 being purely infinite ((1), p. 172, Theorem VIII): That 9 
belongs to class (III,,). 

We prove now: 

Lemna 2.2.1. A ring 2£C OM is purely infinite, if and only if the only normed 
AceLisA = 0. 

Proor: Sufficiency: Assume that the only normed A e£ is A = 0. Apply 
this to projections He &. They must be = 0 or infinite. Hence £ is purely 
infinite. 

Necessity: Assume that £ is purely infinite. Considera normed A e&. Then 
A*A is also normed and also ¢ & along with A. 

We now proceed as in the proof of Lemma 1.3.2, and obtain the spectral 
form of A*A, in particular the equations 


6) (Aaj, 0) = 0) 
@ = [radi Boor 
(8) 4 <0 implies E(A) = 0 


where E(A), ~2 <A < +~, is a resolution of unity. (Cf. loc. cit. above.) 
Consider now a \» > 0. Form the bounded function 


We can form the operator B = a(A*A) which is characterized by 


(45) (Bf, g) = a(n) d(EQ)f, g) 
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((6), pp. 202-205), and which belongs to £ along with A*A ((6), p. 213, Theorem 
§—only the easy, sufficiency part of this theorem is needed). Now B-A*A = 


g(A*A), where 


=1 for 
= 0 otherwise 


= ra(n) 
((6), pp. 205-206, property e) on p. 205); hence 
(B-A*As, = BO) = EOI), 0 


((6), pp. 202-205), and so 
B.A*A =I — 


B, A*A belong both to £ and A*A is normed; hence B.A*A = I — E(Xo) 
belongs also to £ and is also normed. So we have shown: 


(46) implies E(A) = I. 


(8), (46) and (7) give together || Af ||’ = 0, Af = 0 for all fe. Hence 
A=0. 

This completes the proof. 

Lemma 2.2.2. Consider an Abelian ring @ C OM. There exists a sequence of 
projections E, , E,,--- eQwith@ = R(E,, Ee, ---). (Cf. (5), 401, lower part 
of the page.) Since ct is Abelian, these E,, Ez, --- commute with each other. 
So we can form A'**!*2|""- for every A of finite rank (Definition 2.1.2)."° We con- 
sider and E,, Ex, --- as given. 

We make the following assumptions: 

(i) The ring M-A’'(C M) ts purely infinite. 
(ii) For every finite projection F(¢e MM) 


Q implies F = 0. 


Then M is of class (III,). 

Proor: Let F(e 9M) be a finite projection. Then we can form F'*!¥#!""-, 
PII": 3s normed ((i) in Theorem VI), and so in particular «9M. Further- 
more F'*!!*:|"-- commutes with E,, Es, --- ((ii) in Theorem VI), hence with 
all of @ = R(E,, Ez, --- ) and so it is Thus Now 
F'*|*:I--- ig normed and ON-@’ is purely infinite by (i), hence Lemma 2.2.1 
gives F'*:'*2|" — Q. And (ii) permits us therefore to conclude that F = 0. 

In other words: A projection F ¢ 9M is either = 0 or infinite. Hence 9M is 
of class (III,,).— 

The above Lemma 2.2.2 is the device by which we are going to establish 


We could show that A!#1!#2!--- depends only on A and (@ (and not on the sequence 
E,, Ez, -+- itself). We omit the proof, because this fact is immaterial for our further 
purposes. 
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i that some of our examples Ii are of class (III,). (Cf. Lemma 4.3.5.) We 
j shall apply it to rings @ which are = 9-1’, so-called maximum Abelian rings 
in SM. The general theory of such rings is of some interest, but we do not 
propose to go into it in this connection. 


Cuaprer III. Construction OF THE Rines ON, 


3.1. This chapter will be devoted to the construction and investigation of a 
family of examples of factors 9 in appropriate Hilbert spaces §. Some of 
these SI will be shown to be of class (III,,). More precisely: We shall succeed 
in determining the class of each 91 of this family, and we shall see that all 
classes (I,)—(III,,) occur among them. 

The examples which we shall consider are a generalization of those, of (1), 
pp. 192-209, which belonged to the classes (I,,)—(II,,). We desire, however, to 
treat the intervening notion of (generalized) Lebesgue measure in a slightly 
different way than was done loc. cit. above. Furthermore, this part of our 
iy discussion will have to be made on a broader basis than was necessary there. 
For these reasons we begin with a discussion of the properties of (generalized) 
Lebesgue measure and integration in an arbitrary space S. 


3.2. DEFINITION 3.2.1. Let S be a fixed set. A system T of subsets of S will 
be called a ‘“‘Borel-system’’ if it has the following properties: 


(i) © (the empty set) « T. 
(ii) M eT implies S — M eT. 
(iii) M,, M2, eT imply M, + Me + er 


| : i Lemma 3.2.1. For a Borel system T in S we have further: 


(i) 
(ii) M,N eT imply M — 
(iii) M,, Me, eee eT imply er. 


Proor: We prove these statements in a somewhat changed order: 


Ad (i): By (i), (ii) in Definition 3.2.1, since S = S — 0. 
Ad (iii): By (ii), (iii) in Definition 3.2.1, since 


= 8 ((8 — My) + (S — 


Ad (ii): By (ii) Definition 3.2.1 and (iii) above, since M — N = M-(S — N). 


! ke DEFINITION 3.2.2. Let A be an arbitrary system of subsets of S. Then there 
f exists a minimum Borel-system T D A in S,” which will be denoted by T'(A). 


1 The intersection of all Borel-systems [ > A in S. 
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Deriniti0n 3.2.3. A Borel-system 1 in S will be called “‘separable,” if th 
exists a system A of subsets of S with the following properties: 
(i) T= 
(ii) fs, then x = y. 
(iii) A is finite or enumerably infinite .— 


We can now define our notion of (generalized) Lebesgue measure. (To be 
called I'-measure, ef. below.) 

Derinition 3.2.4. Let a separable Borel-system T be given in S. A function 
u(M) will be called a “‘T-measure’’ if it possesses the following properties: 


(i) (M) is defined for all M €T and only those. 
(ii) The values of u(M) are real numbers 2 0, S + ~. 
(iii) There exists a sequence + 7? + = §, 


such that all <+o., 
(iv) If Mi, M2, --- eT andi ~ j implies M;-M; = then 
u(M, + Me +---) = w(Mi) + + --- 


We shall always consider measures with u(S) = 0. 
DertNiTION 3.2.5. A complex valued function f(x), defined for all x « S, will 
be called “‘T-measurable,”’ if the sets 


(t;Rf(z) >a), (x; Sf(z) > a) 


are eT for every real a.— 

It is not necessary to develop the properties of the class of all '-measurable 
functions in detail since they are the same and obtained in precisely the same 
manner as those of the class of all Lebesgue-measurable functions of one real 
variable. (Cf. any standard exposition of that subject. For a systematic dis- 
cussion of such function classes as ours, cf. e.g. (0), pp. 232-247. Our I'-measur- 
able functions f(x) are to be characterized in the terminology loc. cit. as follows: 
K(x), Sf(x) are both functions of class (I, *) or, which is equivalent in this 
case, of class (T, T).) It may suffice to observe that the following functions in 
particular are '-measurable: 

(i) The function Ou(zx) { 
(ii) af(x), f(z) along with f(x), a being any complex constant. 

(iii) f(z) + g(x), f(w)g(x), Max (f(x), g(x)), Min (f(z), g(z)) along with f(z), g(2). 
(iv) vi = limn.,,fa(x) (if this limit exists for all z¢S) along with f,(z), 


For the I-measurable functions f(z) we now define the notion of I’, u-tntegra- 
bility (with respect to the T-measure yu!) and the numerical value of the I’, u- 
integral 
(1) (M eT) 

M 


of any M eI. 
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in the usual way. (Cf. any standard exposition of Lebesgue-integration. The 
observations of (1), p. 194, top of that page, apply to the present situation.) 
There is again no need to enumerate the well-known details. Let us mention 
this, however: For a general, complex valued, f(x) we restrict, as usual, (47) 


to the case where f(z) is even T, u-summable over M that is, where il | f(x) | dye 
M 


exists and is finite. In such situations, however, where f(z) is restricted to real 
and = 0 functions, we shall consider (47) as meaningful for all I'-measurable 
f(x), expressly permitting in this case the value + © also for the integral. This 
convention too agrees with the general usage. 

Throughout the rest of this section, as well as in §§3.3-3.7 which follow, we 
assume that S, I w are given, and also that [is a separable Borel-system. 
Other I'-measures », »’, v’’ and groups §, which will be considered, may vary. 

We construct next a functional Hilbert space over S. 

DEFINITION 3.2.6. Let Ss = ODs,r,u be the set of all (complex) valued | T'-measur 


able functions f(x) (a € S) with a finite [ | f(x) |? dyx. Consider f, g as identical 


if f(x) = g(x), except for an x-set of u-measure 0. Define 


Lemma 3.2.2. The space sz fulfills the postulates A, B, C, E of (4), pp. 64-66. 
(This includes the existence and finiteness of the integral which defines (f, g)s for 
f,g€9s.) Thus Os is a finite-dimensional Euclidean or a Hilbert space. 


Proor: Existence and finiteness of [ f(x) g(x) d,x: Literally as in (4), 
8 


p. 109. 

Ad A, B, E: Literally as in (4), pp. 109, 111. 
Ad C: Same as in (4), pp. 109-111, except that the system of neighborhoods in 
which occurs there (Q is a metric space which plays the réle of our S), is to be 
replaced by the following sets in S (all ef): T°7;, i, 7 = 1, 2, ---, where the 
T”, T®,... are the sets from (iii) in Definition 3. 2.4, saa the 7,, T2,--: 
are all sets which obtain by any (finite) number of Nanloations of the operations 
M —N,M +N, M-N to the elements of a finite or enumerably infinite basis 
A of I in the sense of Definitions 3.2.2, 3.2.3.— 

It may be well to recall at this stage the examples a) and .b) of (1), p. 193. 
(The sets T°, T®, ... given there serve both for (iii) in Definition 3.2.4, and 
to construct A, bi A = (T°, T®,...), = T(A). T turns out to be in 
the example a) ‘the system of all sub-sets of S and in the example b) the system 
of all relative Borel sets in S in the usual topological sense.) The example a), 
with a finite set S = (2, ---,2%,) shows in particular that 6s may actually 
be a finite-dimensional Euclidean space. 

We conclude this section by proving the “differentiation-theorem” of 
Lebesgue-Nikodym (Theorem VII below). It would be possible to carry out 
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most of what we want also without this theorem, but such a procedure would 
necessitate qualifications and restrictions in the hypotheses of the lemmas and 
theorems which follow. 
Lemma 3.2.3. Consider two T-measures u, v in S. Then we have: 
(i) If »(M) = 0 implies v(M) = 0 then there exists a T-measurable function 
x(t) 2 0, such that for all T-measurable (complex) functions f(x) and all sets M «T 


(This is meant to include that if either side is defined and finite, then both are.) 
(ii) The x(a) of (i) ts unique, up to x-sets of u-measure 0. This is even the case 
if (i) is restricted to f(x) = 1, M eT. 

(iii) The equivalence of u(M ) = 0 to o(M) = 0 is equivalent to having always 
x(x) > 0 in (i), except for an x-set of u-measure 0. 

Proor: Ad (i): Consider the T”, T®, .-. of (iii) in Definition 3.2.4 first 
for p: T, TP, --- and then for »: T3”,---. Let Te”, Ts”, --- be the 
oF, i j = 1, 2,---, arranged as a sequence. They will then do for’ the 
T-measure defined by 


(48) p(M) = + »(M) (M eT) 
in the sense of (iii) in Definition 3.2.4. The same is true for 7'§0’, TS, --- 


if we define Ty = Ts? — (To? +--- + To”), i = 1,2,---. And these 
have the further property that + j implies = 0. 
Let us form 


(49) Lif) = f(x) d,x for any fe Ds,r,p- 
00 
(Observe: v in the integral, but p in Os,r,,!) Since 
and 
/ Pda = ro(Tii) 


are both finite, the integral in (49) is (by Schwarz’s well-known inequality) 


existing and finite, and even 
4 
| Lf) | = f(z) | f(x) | 
< 


(50) | Lif) |S (C; = (Too). 
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Li(f) is by (49) a conjugate linear functional in $s,r,, , and by (50) a continuous 
one. Hence a lemma of F. Riesz applies to it (cf. e.g. (4), p. 94, footnote 52): 
There exists a x; € Hs,r,, , such that Li(f) = (x; , f) that is 


| i Consider now a I'-measurable f(z) = 0, which is = 0 for x ¢ 1's0’, and bounded 
in S. Then f  Ss,r,, (because is finite!), and the above equation may 
be written 


Since it does not matter in (51), what f(x) does for x ¢ Ts we can drop the 
requirement of f(x) = 0 for x ¢ Tv. 
Taking the real part of both sides of (51) replaces (since f(x) is real) «;(z) 


( by 9x;(z). Hence we can assume that «;(x) is real. Now put 
= 1 forxzeM 


i first with M = M; = T§)-(a; xi(z) < 0), and then with M = MY = 
! Too - (x; x:(t) = 1). In both cases f(x) meets the requirements of (51). The 
left side of (51) becomes = 0 and S 0 respectively, and so we obtain 


k(x) d,x = 0, / k(t) <0. 
Mt 


But «’(z) < 0 for all x and x’(x) > 0 for all x M7; hence necessarily 

«| u(M;) = u(M7) = 0. This also implies »(M;) = »(M?) = 0. Hence we can 
Li change «;(z) arbitrarily in M; + M; without affecting the validity of (51). 
We redefine «;(x) = 0 for re M; + M7. So we have 


(52) 0S x(x) <1 (xe TW). 


So the integrands on both sides of (51) are = 0. Therefore if f(x) is merely 
I'-measurable and = 0, we can put 


fale) f(z) for f(a) sn 


= 0 otherwise 


replace f(x) by f(x) in (51), and apply lim,.,,,: Then (51) for the original f(x) 


— 


fe results. In other words: (51) holds for all T-measurable f(z) = 0. (Of course 
‘ a then both sides of (51) may be infinite.) 
Now put «(z) = Then (52) gives 
1 — x;(z) 


(53) 2 0 (x Too). 


W 
f(x) d,z = d,x = x(x) dyx + 
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We can replace f(x) in (51) by res ; then (51) becomes: 


(54) f(x) d,x = f(z) ki(x) d,z, 


Too 


again for all T'-measurable f(z) 2 0. Now define «(z) = «,(x) for xe Te, 


i=1,2,---. Then (53) gives 


(55) k(x) 2 0 (x 8). 
And adding (54) over all 7 = 1, 2, --- gives 
(56) [ ax = f fle) dye. 


Consider now any I'-measurable (complex) f(z). Apply (56) to | f(x) | (which 
is 2 0): It shows that if either of i | f(a) | dear, [ | f(x) | x(z)dx is Ginite, 


then both are, That is (f(x) is complex!): If either of [ [ (2)d,2 


is defined and finite, then both are. Combining this with the validity of (56) 
for T'-measurable f(z) = 0 shows: (56) holds for all [-measurable (complex) 
f(z), this being meant to include, that if either side of (56) is defined and finite, 
then both are. 
Now define f’(x) Then (56) becomes (in the 
same sense as above) 


And (55), (57) complete the proof of (i). 

Ad (ii): Assume that x«’(x), «’’(z) both meet the requirements of (i), with the 
restriction made in (ii). Consider T°, T, --- of (iii) in Definition 3.2.4 for », 
and put Mj, = T.(x; «’(4) < x(x) Sn). Consider the function f(z) = 1 
and M = M;,,. Then we have »(Mj.,) = = K(x) dye. 

M 


Since (Mi) is finite, we can infer — = 0. 


But «’’(x) — «’(z) > 0 for hence we have u(M;,,) = 0. Put M’ = 
Mi = (a; < «’’(x)), then this gives 4(M’) = 0. By symmetry in 
x'(x), we have similarly for M” = (x; «’(z) > «’’(x)) = 0. And 
sofor M = M’ + M” = (a; «'(a) ¥ «’’(x)) again u(M) = 0. Thatis: «’(z) = 
«(x), except for an x-set of u-measure 0 (namely: M). 

Ad (iii): Sufficiency: Assume «(x) > 0, except for an x-set of u-measure 0. 


»(M) = 0 implies [ x(x)d,2 = 0, and since «(x) > 0 for x « M (except for an 
M 
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z-set of u-measure 0), so u(M) = 0. Hence u(M) = 0 and »o(M) = 0 are 
equivalent. 
Necessity: Assume that u(M) = 0 is equivalent tox(M) = 0. Put 


= lforre M 
= Ou(z) 0 otherwise |’ 


= (x; x(x) = 0). Then we obtain »(M) = x(x)d,x = 0, hence u(M) = 0 


(by the above assumption). So we have x(x) > 0, except for an 2-set of u- 


DEFINITION 3.2.7. We oe the unique (up to x-sets of u-measure 0!) x(z) 
of (i), (ii) in Lemma 3.2.3 by & “ (x): 
TuHroreM VII. Letybea T-measure in S. Then the relation 


x(x) 


= 


establishes a one-to-one correspondence between all T-measurable functions x(x) = 0 
(up to x-sets of u-measure 0!) on one hand, and all T-measures v, for which u(M) = 0 
implies v(M) = 0, on the other. The inverse relation is 


»(M) = 


Proor: Given »(M), as described above, the existence and uniqueness of this 
x(x) was already stated in (i), (ii) in Lemma 3.2.3 and in Definition 3.2.7. So 
we must only consider the case where x(x), as described above, is given. 

Consider the T°, T®, .-- of (iii) in Definition 3.2.4, and let T§?, T”, --- 
be the 7. (x; x(x) < n), i, n = 1, 2, --- , arranged as a sequence. They will 
then do for the !'-measure defined by 


(58) vy(M) = k(x) d,x (M 


in the sense of (iii) in Definition 3. 2. 4. It is also clear by (58) that u(M) = 0 


implies y(M) = 0. Hence we can form Z (x). 


Put f(z) = 1, then we have »(M) = / d,z = [. . (x) dx. So (58) gives 


(x) d,x = x(x) dx (M eT). 


Hence by (ii) in Lemma 3.2. 3g = «(z), except for an x-set of u-measure 0. 


This, together with (58), aS. the proof. 
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(i) = 1, 
(i) (x) of = 0 implies = 0, and v(M) = 0 implies 


p(M) = 0. 
Proor: (i), (ii) are both immediate on the basis of the characterization of the 


‘ (x) in (i), (ii) in Lemma 3.2.3. 


3.3. Let S, T, » be as in the preceding section. We now describe a certain 
relationship between S and a group §. 

DeFINITION 3.3.1. ts an S-group if it has the following properties: 
(i) $ is a “group,” that is, a “composition” aB, an “inverse” a‘, and a “unit” 1 
are defined in § with these properties: 


(aB)y = a(By), aa! = = 1, al = la =a. 


(So § is “associative,” but not necessarily ““commutative.”’) 
(ii) Every a ¢ S defines a certain one-to-one mapping of S on itself: 


(x, ra 8). 
These mappings “represent’”’ G, that is 
= (x eS, a, 


(iii) is finite or enumerably infinite — 

One sees immediately (by (ii) above) that 21 = zx and that x @ za‘" is the 
inverse mapping to x = za. 

We shall, as usual, denote by Ma the set (ra;x «M) (M, Mac S). 

DEFINITION 3.3.2. An S-group is an “S, T-group” if for everyaeS M eV 
implies Hence if aT-measure in S, then the defined by 


Ha(M) = 

is also a T-measure in S. 

DEFINITION 3.3.3. An S,T-group an ‘“‘S,T, u-group” (u aT-measure in 8), 
if for everyaeS p(M) = 0 implies u(Ma) = 0. 

Lemma 3.3.1. Let Gbean u-group. Then we have: 
(i) u«(M) = 0 and u.(M) = 0 (that is: u(Ma) = 0) are equivalent. Hence we 
can form (zx). 

(ii) Each one of the following formulae holds, except for x-sets of u-measure 0: 


(zx) > 0, 


dy, d, 
a,” = =1, 


(ea) = (x) 
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) 
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(iii) Soe (x) is uniquely characterized, up to an x-set of u-measure 0, by this property 


(f(x) is any T-measurable function, M «1, everything in the same sense as in (i) 
in Lemma 3.2.3). 

Proor: We prove these statements in a somewhat changed order. 

Ad (i): = 0 implies = 0, and 4(Ma) = O implies = 0, 
u(M) = 0. Hence p(M) = 0 is equivalent to u(Ma) = 0, that is to ua(M) = 0. 

Ad (ii), first formula: Immediate by (i) above and (iii) in Lemma 3.2.3. 


Ad (iii): According to Definition 3.2.7, dg (x) is characterized, up to an z-set of 


u-measure 0, by 


(59) = @ aye 


(f(x) l-measurable, M 1). Now obviously / = 
So (59) becomes 


(60) dy = ste) (0) dye 


Replacement of M, f(z) by Ma", f(za) in (60) gives the desired formula. 
Ad (ii), second formula: Coincides with (i) in the Corollary after Theorem VII. 


Ad (iii), third formula: Two applications of (iii) above show, that ore (xa) = (z) 


satisfies the characterization of fee (x) by (iii) above. 


Derinition 3.3.4. An S,T, u-group is “free” if it has this property: 

If aeGis # 1 then xa = x holds only for an x-set of u-measure 0. 

Dertnition 3.3.5. An S,T, u-group G is “ergodic” if it has this property: 

If M «TY is such that for every a €G the difference of the sets M, Ma (i.e. the set 
(M + Ma) — (M-Ma)) has u-measure 0, then either u(M) = 0 or pn(S — M) = 0.— 

Cf. the comment in (1), p. 195, bottom of the page, concerning this notion of 
ergodicity. 

Lemma 3.3.2. An S,T, u-group S is ergodic if and only if it has the following 
property: 

If f(x) is a T-measurable function such that for every a eS f(xa) = f(x) (except 
for x-sets ofu-measure 0), then f(x) = a (constant!, except for an x-set of u-measure 0). 

Proor: Sufficiency: If f(x) runs only over characteristic functions 


fiz) = { 1 M 


= 0 otherwise |’ 
then this coincides with Definition 3.3.5. 
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Necessity: Since we may consider Rf(x), 3f(x) instead of f(x) we may assume 
that f(x) is real. And since we may consider aes instead of f(z), we may 


even assume that 
(61) < f(z) < 1. 


For any real a the set M, = (x; f(x) < a) has clearly the property used in 
Definition 3.3.5; hence either u(M,) = 0 or n(S — M,) = 0. Let 2 be the set 
of all a with = 0. By (61) —le2,1¢2, and asforasb 
so in this case b ¢ 2 implies ae Z. Hence = possesses a least upper bound, say 


1 
ay and a €2, ay + = ¢2forn = 1,2,---. 


Hence the sets (x; f(z) < ao) = and f(t) > aw) = 
(§ — Ma,41/) have w-measure 0, and so has their sum, the set (x; f(x) # a). 
That is: f(x) = a , except for an z-set of u-measure 0.— 

Our present notion of a free, ergodic, S, T, u-group corresponds to what was 
called an ergodic m-group (for S, ) in (1), page 195, Definition 12.1.5, but there 
the stronger requirement u(M) = u(Ma) replaced our present requirement that 
u(M) = imply = 0. 

This situation motivates the following definition: 

Derinition 3.3.6. An S,T, u-group G is “measurable” if there exists a Y- 
measure v with the following properties: 

(i) u(M) = 0 is equivalent to v(M) = 0. 
(ii) Always »(M) = »(Ma).— 

We conclude this section by proving two properties of the above notions. 

LemMa 3.3.3. The properties of the T-measure v given in Definition 3.3.6, are 
equivalent to the following properties of its «(z) = : (2) (remember Theorem VII), 


always except for x-sets of u-measure 0: 


(i) x(x) > 0. 
k(x) _ dy, 
(ii) = (x). 


Proor: (i) is equivalent to (i) in Definition 3.3.6: Coincides with (iii) in 
Lemma 3.2.3. 


(ii) is equivalent to (ii) in Definition 3.3.6: i (x) = x(x), hence clearly os (x) = 
Ka 


x(ta). So (ii) in the Corollary to Theorem VII gives (by evaluating os (x) in 


two ways): 
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Now (ii) in Definition 3.3.6 means vy = vq, that is, by (i) in the Corollary to 


Theorem VII > (x) = 1. Hence (62) becomes the desired equation ae es 


(z). 


3.3.4. Let be an ergodic S, u-group. Then the V-measure v of 
Definition 3.3.6 is uniquely determined, up to a constant factor a > 0. 


Proor: Let »’, v’’ be twosuchT’-measures. Form their x’(x) = (x), = 


7 in accordance with Lemma 3.3.3. Then (ii) in Lemma 3.3.3 gives 


(ea )= = Put f(x) = 

x’(x)/«’"(a) and replace « by a’, then we have: f(z) = f(za). All this, of course, 
holds with the exception of z-sets of u-measure 0. Now Lemma 3.3.2 gives 
f(x) = a, x'(x) = ax’’(x)(in the same sense) with a constant a. Hence v’(M) = 
av’’(M), and clearly a > 0. 


3.4. Derinition 3.4.1. (i) Let ©9 be the set of all (complex-valued) functions 
f(a) (a €§) with a finite Lag | f(a) |. (Remember that § is finite or enumer- 


ably infinite!) Define 


(f, 9)3 = fla)g(a). 
(ii) Let 98 = 9 | be the set of all (complex-valued) functions F(x, «) (x €S, 
a €$) which are Pintewelles in x for every a €G and with a finite 


Deeg [| FG, a) 


Consider F, H as identical if F(x, a) = K(x, a) except for an x-set of u-measure 
0 for everyaeS. Define 


= Fle, \HG, dys. 


= (F, 


SU 


Lemma 3.4.1. (i) The spaces $8, $8 fulfill the postulates A, B, C, E of (4), 
pp. 64-66. (This includes the existence and finiteness of the sums and integrals 
which define (f, 9), (F, H)8.) Thus $8, 8 are finite dimensional Euclidean or 


Hilbert spaces. 
(ii) Using the complete, normalized, orthogonal set of all functions 


=1 for a= a 
0 otherwise |’ ane 8, 
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in 99 and setting up the correspondence 
F(a, a) ~ < F(a, a), F(x, > 
(a1, an enumeration of G, so that the above ao runs over , 9° is 
isomorphic with ©2 ® Hs in the sense of (1), p. 135, Definition 2.4.1 and Lemma 
" what follows we use the more symmetric notation 
F(z,a) ~ < F(a, a); aeG>. 


Proor: We prove these statements in a somewhat changed order. 


Ad (i): Character of 99: Put f(an) = f&,n = 1, 2,--- (G1, a2, --- an enumer- 
ation of 9!), and f(a) ~ {&, &, +--+} then our definitions coincide with the 
original definition of a finite dimensional Euclidean or a Hilbert space ((1), 


p. 69, lower part of the page). 

Ad (ii): Immediate, by comparing our Definition 3.4.1 with (1), p. 135, 
Definition 2.4.1 and Lemma 2.4.1. 

Ad (i): Character of 9°: Considering what we showed about 99 above, and 
about $s in Lemma 3.2.2, this follows from (ii) 4h since 99 i is isomorphic to 
® Gs .— 

We are now in a position to begin a systematic investigation of certain oper- 
ators in Hs and 98. 


3.5. Throughout what follows, § will be a fixed S,I’, u-group. We assume 
from now on throughout §§3.5-3.7, that G is free and ergodic, but not that it is 
measurable. 

We begin by constructions in Qs . 

Lema 3.5.1. Define the operators: 


(i) Usfla) = i+ @) | flea) for any a «8. 


(ii) Lea f(x) = g(x)f(x) for any (complex-valued) l-measurable, bounded function 
These operators are bounded, U _ is even unitary. 

Proor: U. is bounded and unitary: By (iii) in Lemma 3.3.1 


J vaste) = f | = ff |e) 
So f « Ss implies Gs and || Uaf lls = lls- 


ee is bounded: We have | g(x) | S C for all z eS and a suitable C > 0. 
ence 


[ | Low f(x) = [ | | f(a) < C’ [ | f(x) |? 
So Ss implies Lyra) f € Gs, and || Lyre f lls C Ils. 
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Lemma 3.5.2. (i) U; = 1 
(i) = = Us. 
| (iii) Uses = 
‘ Proor: We prove these statements in a somewhat changed order: 
i Ad (i), (iii): Immediate by (ii) in Lemma 3.3.1. 
fi Ad (ii): (i), (iii) above give Ue-1 = Uz". And since U, is unitary, so U;' = U* 
Hi Lema 3.5.3. Let & be the set of all operators Lz) from (ii) in Lemma 3.5.1, 
Then & is a ring, and & = &. 
i Proor: As £’ is necessarily a ring it suffices to prove = L’. As LyyayLye) = 
4 


80 Loca) commutes with every Ly. AS Lye = Lye, 80 Lye) also 


| commutes with every Lj;2). Thus Ly ¢ £’, that is, LCL’. So we must only 
prove £’C &. 
Consider therefore an A Then A commutes with every Ly 2) , that is 
ALyawf = Ly Af. That is 
1 
(63) A(o(x)f(x)) = o(x)Af(z). 
The assumptions are: g(x), f(z) T'-measurable, g(x) bounded, | f(a) ( 
8 
a4 finite. (63), as well as all subsequent relations in this proof, holds with the ( 
| p exception of an x-set of u-measure 0. 
d Consider the 7,7, ... of (iii) in Definition 3.2.4; we can replace them by 8 
TS, TS, where T§? = — (T® +... + T*”), and then we have also 
that i j implies 75”, = ©. Define 
; fe 
=1 for reT 
e;(x) = (zx) 0 otherwise d 
Then [ | ex(x) |’ = is finite, and so we can put f(x) = in (63). 
8 
Bei We obtain: 
1 
(64) = (Aes(x))¢(z). 
As e,(z) is bounded, we can also put g(x) = e,(r) in (64), and so we obtain 
= (Aes(x))e.(x) that is: 
(65) Ae(z) =0 for 
Now define p(x) = Ae,(x) for ze 7§”, i = 1,2,---. Then (65) means ( 
(66) Aes(t) = p(z)-ed(2), 
and consequently (64) becomes: 
Summing (67) over i = 1, ---, n, and writing f(x) for g(x) e:(x), we obtain 
(68) Af(z) = 
if f(x) is bounded and = 0 for ¢ + + T§”, for any n = 1,2, 


| 

il 
| 
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A is bounded; so || Af ||s S C-||f||s for a suitable C > 0. Put M, = (2; 
2 +7”), n =1,2,---. We may use 


=1 for 
f(z) = 0 otherwise 


‘ in (68). Then we have: 
ly 2 [C+ = C+ 
So || A@w, Ox, gives «(M,) = 0. Consequently for M = M, = 
x (x; | p(x) | = C +1) also u(M) = 0. Hence we have (with the usual exception) 
le (69) le(z)|SC+1. 
y So p(x) is bounded, and we may form L,,2). And (68) becomes: 


for all f(z) of (68). But these f(z) form an everywhere dense set in Hs. In- . 
deed, if any f ¢ Ss is given, then define | 


| fl) for |f(z)| <n, + + 7” 


Then all f,(z), n = 1, 2, ---, satisfy the requirements stated after (68), and 
clearly limps || fn — f = 0. 

n As A, L,,2 are both bounded, this means that (70) extends by continuity to all 
feDs. Hence A = Lj) €&. So we have established £’ C £& too, thus com- 
pleting the proof. 

Lema 3.5.4. Loz) = Lye) Ua holds if and only if either = 1, o(x) = ¥(z) 
(except for an x-set of u-measure 0), or a ~ 1, o(xz) = (x) = 0 (except for an x-set 
of u-measure 0). 

Proor: Sufficiency: Obvious. 

N ecessity : Assume Lgtz) = LyayU a: That is 


(1 ola) fla) = ki | see), 


whenever / | f(z) |’d,x is finite, (71), as well as all subsequent relations in this 
8 


proof, holds with the exception of an 2-set of u-measure 0. Ae 


| 
| 
— 
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Consider the T”, 7, .-- of (iii) in Definition 3.2.4... We may put 


=1 for zeT? +...4+ 7” 
= 0 otherwise 


f(x) = 


in (71), since u(T + --- +7) is finite. Then (71) implies 


(72) ola) = v(a) 


for those x for which z, rae T° +... +7. And summing over n = 1, 2, ... 
gives (72) for all z. 

(72) settles the case a = 1, owing to (ii) in Lemma 3.3.1. Assume therefore 
a £1. Substituting from (72) into (71) gives g(x) f(x) = ¢(x)f(xa), that is 


(73) Piste) flea) = 0. 


Now consider the system A of Definition 3.2.3. We have A = (Mi, Mp, ---) 
by (iii) eod. We may put 


(i) 
1 for zeT 


= 0 otherwise 
i,j = 1,2, ---, since n(7-M,) < u(T™) is finite. Now clearly 


= 0 if neither or both 2, rae T.-M; 
= 1 otherwise 


|f@) 
and the latter 2-set is 
Ky = ((7°-M) + (T°-M)o*) 
Hence (73) becomes 


(74) \e@ = 0. 


Consider now K = )o7j1 Ki;. If 2 = xa, then never x K;;, and hence 
not eK. Ifz # za then we know from (ii) in Definition 3.2.3 that for some 
j = 1, 2,--- one, but not the other of z, za is e M;. The one ¢ M; is also 
e 7 for some i = 1, 2, --- , hence one, but not the other of.2, za is ¢ 7 -M;. 
This means x e K;;._ Consequently x eK. So we see 


(75) K = ¥ za). 


Since G is free, and as a ¥ 1, so we have by Definition 3.3.4, x ¥ za, except for 
an z-set of u-measure 0. That is 


(76) u(S — K) =0. 


| 
| 
| 
—— ‘ 
pl 
Qa 
i) 
{ 
(c 
(i 
( 
a 
+ 
| 


ON RINGS OF OPERATORS, III 139 


Summing over i,j = 1, 2, +++ converts (74) into [ | 0, and 80, 
by 76) Pde = 0. That is 


(77) e(z) = 0. 


So we have for a # 1 by (72), (77), o(x) = ¥(x) = 0, and the proof is com- 

leted. 
ieee 0 Let & be as in Lemma 3.5.3, and let U be the set of all U., 
aeS. Then £-W = (al) (a any complex number). 

Proor: al belongs obviously to WU’, and as al = Lae&, so ale &-WU, 
e.WD (al). So we need only to prove £-WU’ C (al). 

Assume therefore A Then A so for everyaeS = 
= A. Also ACL A = Low, U,AUs = Lge), hence Leea) = 
Ly. By Lemma 3.5.4 this means g(ta) = g(x), except for an z-set of 
u-measure 0. Since G is ergodic, this implies by Lemma 3.3.2 that g(x) = a 
(constant!) except for an z-set of u-measure 0. Hence A = L, = ai. Thus 
£.U C (al), completing the proof. 


3.6. We now pass to constructions in 99. 
Lemma 3.6.1. Define the 


(i) Ua F(x, a) = “° (x) 
(i) V.,F(z, a) = F(a, ao 

(iii) WF(x, a) = (x) | Flra a’). 


(iv) a) = = oe) Fe, a) for any bounded and T-measurable 

(v) MeaFla, «) = F(x, a)) which is defined for all x¢ 8. 
These operators are bounded, Ua, Vay, W are even unitary. 
Proor: U.,, V«,, W are bounded and unitary: By (iii) in Lemma 3.3.1 


for any a € 


[\0..50, «) Paye = p> (a) dye 


ae 


and 


z «) Page [ \ (a) dye 


| 
il 
| 
| 


At 
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And obviously 
ae 


So Fe implies U.,F, Va,F, WFe H%, and || = || = 
|| = || FIP. 


Liz), are bounded: We have | | C for all ze S and a suitable 
C > 0. Hence 


aeG 


aeG 
and 


se [ | F(a, 


So Fe implies Lo F, Meco) F $8, and || Loy || Moco F < C || FIP. 
3.6.2. (i) = I. (i)’ VY, = I. 

Gi) = = = = Vi,. 

(iii) = (iii)”’ = V assy 

(iv) W = W = W. 

(v) WU.,W = 

(vi) = Myz. 
Proor: We prove these statements in a somewhat changed order: 
Ad (iv): We aes? W’. One verifies immediately: 


W F(x, x) = (x) (va F(a, By (ii) in Lemma 3.3.1 (with a”, 


a in place of a, 8) (2) = (2) = 1, so a) = Fz, a), 


that is: W? = I. 
Hence W = Wand since W is unitary, so W = W*. : 
Ad (v): Owing to (iv) we may as well prove WV,,W = U.,. Now 


a) = ‘(x ) eee , aco). By (iii) in Lemma 3.3.1 


(with a’, aap in place of a, 8) 


Ww 
§ 
1 


S 


| 
| 
| 
| | 
i) 
| ( 
| 
| t! 
| 
| 
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tee = (2), 00 WV a) 
= @| F(xao, = F(x, a), 


d, 
that is: WV = 

Ad (vi): Owing to (iv) we may as well prove WL, 2) = M,W. This, how- 
ever, is obviously true, since a) are both = 

Ad (i)’, (iii)’: Obvious. 

Ad (ji)': (i)’, (iii)’ above give Vaz: = Vz). And since V,, is unitary, so 
Va = Vi, 

Ad (i)- (iii); Considering (iv), (v) above, they result from (i)'—(iii)’ above, re- 


spectively. 
Lemma 3.6.3. Let 9 be the set of all Ua, and all Lez) , and § the set of all Va, 


and all Mg. (Cf. Lemma 3.6.1.) Form for each bounded operator A in $9 
the decomposition from (1), p. 136, Definition 2.4.2: A ~ <Aag>, a, B 9, 
where every Aa,s is a bounded operator in Ss. (Cf. the decomposition $9 = 
9° @ Gs from Lemma 3.4.1. As described there, we replace the indices t, s = 
- by indices a, B €%, as already the complete, normalized, orthogonal set 
Ga, , «0 €9, cf. loc. cit. above, in G8 has been indexed this way.) Then we have: 
(i) A eS of and only tf Aap = Lxag-1@6-). 
(ii) J’ of and only tf Aa = 


Here x(x) must be a (complex valued) T'-measurable, bounded function of x for | 


every 
We do not determine for which systems x,(x), y «8, a bounded A satisfying (i) or 


(ii) actually exist. This, however, is true: 

(iii) The systems x(x), y €9, obtained from all A « J’ by (i), and those obtained 

from all A ¢ J’ by (ii), are identical. 

(iv) An Aye I’ and an Aze J’ which have the same x,(x), y €G, by (i) or (ii) 

respectively, obtain from each other by the involutory automorphism F = WS 

of 

Proor: Remember the definition of the Aa, ((1), p. 136, Definition 2.4.2, 
using our present notations) : A<f(x)ga(a); = <Aaaf(x); BeES>; 
that is (if we replace a’, a by a, 6 so that now z ¢ S and 8 «§ are the variables, 
andaeGisa parameter) : Af(x)ea(8) = Aaaf(x). Then the definitions of U.,, 
Ly, W give: If A ~ <Aas>, a, BG, then 


(79) WAW ~ a>, a, B eS, 
(80) LewA ~ <LywAas>, a, B eS, 


(81) AL ge) ~ a, B eS. 


= 


| 
4 
| 
3 
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We now proceed to prove the statements (i)-(iv) in a somewhat changed 


order: 
Ad (i): AeJ means that A commutes with all Love) 
Since = Lew , U2, = it suffices to consider the Lyi), Ugiz). That is 


(83) = A 


(82) means owing to (80), (81), that all Aas «¢&’, that is, by Lemma 3.5.3, 
Aage&. So 


Wa,g(x) a (complex-valued) I'-measurable, bounded function of z. 
(83) means, owing to (78), that 


(85) Aap = a, - 
Substituting (84) into (85) transforms it into Ly, = paz that is 
(86) Wag(X) = Waas! par 


except for an z-set of u-measure 0. 
If we put ap = 8B, and write x,(x) for w,,:(x), then (86) specializes to 


(87) Waa(t) = Xas-1(28 


except for an x-set of u-measure 0. 
Again (87) obviously implies (86). 
So A eI’ is equivalent to (84), (87), that is, to 


(88) Aas Ly (28-1) 


This completes the proof of (i). 

Ad (ii), (iv): Since W is unitary, 2 WF is an automorphism of 9%, since 

W = W"”, W’ =I (by (iv) in Lemma 3.6.2), this automorphism is involutory. 
This automorphism carries an operator A in 9° into the operator WAW. 
Hence it carries Lez) into Mera) (by (iv)—(vi) in Lemma 3.6.2), and 
so Jinto J. Hence it carries J’ into J’. 
_So the general A: J’ obtains by forming A, = WA,W for the general 
A,eJ. Or (by (iv) in Lemma 3.6.2) equivalently by forming WA,W. Now 
(88) gives the Aas of the general A; ¢ J’, hence (79) permits us to find the Aas 
of the general Az, e J’. We obtain in this way for the latter Aa g : 


(89) Aes = Us 


| 
} 
4 
( 
| 
| 
| i 
i; 


ON RINGS OF OPERATORS, III 143 


But since Ue = = Uphy,-i,(2) , (89) can also be written 
(90) Aap = 


This completes the proof of (ii). 
And we have also established all properties of = W£ claimed by (iv). 
Ad (iii): Immediate by (iv). 
Remark: We have also seen (in the proof of (ii), (iv) above), that f= WF 
carries J into J. And since it is involutory (cf. (iv) above), it also carries 


Jinto J. 
DerFINITION 3.6.1. (i) We denote the set of all systems x,(x), y € 9, which 


occur in (i) or in (ii) in Lemma 3.6.3 (both are the same, cf. (iii) eod.) by Xo. 
(ii) We denote the relation between an Ae SJ’ or an Ac J’ and a system x,(z), 
7 €9, from Xo , as described in (i) or in oe) in Lemma 3.6.3, respectively, in both 


cases by 
A= [xo(z); 7 € 9}. 


Lemma 3.6.4. = J, = J. 
Proor: Since the involutory automorphism 2 WF of 98 interchanges J 


and g (ef. (iv) in Lemma 3.6.3 and the remark after this lemma), and conse- 


quently also R(J), J and R(J), J’, it suffices to prove R(J) = J. 
Proof of R(9) € J: Oné verifies immediately, for A = Ua, 


p(x) that 


= fora = B 
Acs 0” otherwise |” So we have Aag = Ly,-19(2)Us-14, 


g(x) fory = 1 
{= 0 otherwise |’ 


respectively. So A ¢ J’ by (ii) in Lemma 3.6.3. 
We have thus proved JC J’. Since J’ is a ring, this implies R(J) C J’. 
Proof of R(J) D J’: Consider any A « I’, Be J’, then we have by Lemma 
3.6.3, Aas Ly 5 Bas = Put = AB, D = BA, 
then (1), pp. 136-137, (iv) in Lemma 2.4.3, gives: 


= Ay,pBa,, Ly. -1(28-1) Log-14(2) U,-14 


= lfory=a 
= 0 otherwise 


where { 


ve 


71«G 


| 
4 
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(we replaced the summation index y by ay), and 


1€G 


(we replaced the summation index 7 by By). Remembering that the order of 
summation does not matter in the above >),. g (cf. loc. cit. above), we see from 


the above equation that Cas = Da, for all a, Be. That is:C = D, or in 
other words: AB = BA. Since J is a ring, and A eS’ arbitrary, this means 
that Be J’. And as Be J’ was arbitrary, so we have J’ C J”. 

As Ie I(I = U;), so = JI” by (5), p. 397, Theorem 7. Hence we have 
J CRI). 

The proof is thereby completed. 

Lemma 3.6.5. Put M = R(J) = SF. ThenOW = K(J) = I, and M, MW 
are factors (in the sense of (1), p. 1388, Definition 3.1.2). 

Proor: Clearly Dl’ = R(J)’ = JI, this and Lemma 3.6.4 prove all our equa- 
tions. It remains for us to prove that 9, OM’ are factors, that is that 
A implies A = aJ. 

Consider an A Then A eM = J’ and A eM’ = J’, hence Lemma 
3.6.3 gives that 


(91) Aas = Ly 
and 


simultaneously. Comparing (91), (92), and using Lemma 3.5.4 gives: 

For 1, that isa ¥ B: = = 0, except for an z-set 
of u-measure 0. That is: In this case (91) gives Aas = 0. 

For Ba = 1, thatis a = B: xi(z8') = w(z), except for an x-set of u-measure 
0. Comparing herein 8 = 1 with the general Be gives x4(x6") = xi(2), 
with the same exceptions. Since G is ergodic, this gives, by Lemma 3.3.2 (with 
xi(z), 8’ in place of f(x), a): x(x) = a, except for an x-set of y-measure 0. 
Hence (91) gives Aaa = La = al. 

So we see: 


A = al fora = 8, 
| — otherwise 


that is: A = aJ. Thus the proof is completed. 


| 
| 
| 
i 
| 
a 
i 
\ 
; 
| 
| 
i 
| 
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3.7. We derive now the basic rules of algebraica! computation in the factors 
mM, OW. 

Lema 3.7.1. Both in M and in MW’ the following rules of computation hold: 
If A = 9}, B = 7 = 7 9} (all of them eM, 
or all of them « M'!), then we have: 

(i) If C = aA, then n,(x) = ax,(z). 

Gi) = A*, then = 

(iii) =A+ B, then = xy(x) + &,(z). 

(iv) = AB, then n,(t) = The converges 
“en mesure’”’ (cf. the precise description of this notion at the end of the proof), irre- 
spective of the order in which the a €G are gone through. 

Proor: Considering (iv) in Lemma 3.6.3, it suffices to consider the case 
where A, B,C «9. We prove the statements (i)-(iv) for this case in a some- 
what changed order: 

Ad (i), (iii): Immediate by (i), (iii) in (1), pp. 1386-137, Lemma 2.4.3. 

Ad (ii): This follows from (ii), loc. cit. above, by the following consideration: 
Cas = » Cas = Apa » = that 
is ny(2) = 

Ad (iv): Apply (iv), loc. cit. above. We have Cag = Ly,5-1(28-1), Cap = 
the sense of strong convergence, irrespective of the order of theye%. So we 
have = Dove Putting 6 = 1 and writing a, 7 
in place of y, a this becomes: 


(93) Ly) = 


in the sense of strong convergence, irrespective of the order of the a e%. 
Consider now an M eT with a finite u(M), and form 


= lforreM 
Ou(2) {= 0 otherwise 


Then 0% € Ss , and so (93) implies: If a: ,  , --- is an enumeration of §, then 


2 
lim | Ou — 3 = 0. 
That is 
2 
[ ny(x)Ou(x) — Xa; "(waz )Ou(x) | = 0, 
(94) lim ny(x) mg Xa, d,x = 0 


a | 
{ 

— | 
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So we have for every « > 0 and M eT with a finite u(M), owing to (94): 


That is: converges “en mesure” to 7,(x). As &, 
a, --- was an arbitrary enumeration of § this proves (iv). (Of course, if § 
is finite, the convergence considerations are unnecessary, the finite sum i 
being simply equal to 7,(z).)— 

We sum up the results obtained so far. 

TueoreM VIII. Let S be a fixed set, T a separable Borel-system in S (Defini- 
tions 3.2.1, 3.2.3), » a T'-measure in S (Definition 3.2.4) and S an S, T, u-group 
(Definitions 3.3.1-3.3.3; hence G is, in particular, finite or enumerably infinite), 
which is free and ergodic (Definitions 3.3.4, 3.3.5). Form the unitary (finite 


dimensional Euclidean or Hilbert) spaces Ss, ©°, $2 (Definitions 3.2.6, 3.4.1). 
In 9% form the operators Ua, , Va,, W, Lez); (Lemma 3.6.1). Let 9 be 
the set of all Ua, Lez and § the set of all Va, , Mew. Then we have: 


M = R(I) = JS isaring, its’ is WM = KR(J) = I. 


F <= WEF is an involutory automorphism of 99 which interchanges J, R(9), JI’, 
M and J, R(J), J’, M’. MM, MU are factors. The general element A ¢€ OM or 
A has a representation A = {x,(x); y 9} (Definition 3.6.1), for which the 
rules of algebraical computation are known (Lemma 3.7.1). 

Proor: This is merely a restatement of the essential results of various Defini- 
tions and Lemmas in §§3.2-3.7. Besides those mentioned in the text, cf. also 
Lemmas 3.6.4, 3.6.5, (iv) in Lemma 3.6.3, and finally (i), (ii) in Lemma 3.6.3.— 

The results and notions enumerated in Theorem VIII will be made use of 
throughout what follows, without being referred to explicitly. 

We conclude this section by two lemmata on certain special operators in Il 
and 9’. (All -relations in both of them are to be understood to hold with 
the exception of z-sets of u-measure 0 only.) 

Lemna 3.7.2. Assume A eM or M’, A = {x(z);y eG}. Then we have: 
(i) A is Hermitean if and only if x(x) = x,-1(ay") for all y «. This implies 

that xi(x) is real. 
(ii) If A is definite, then x4(x) = 0. 
(iii) If A is definite and x(x) = 0, then A = 0. 

Proor: Considering the isomorphism F 2 WF of MM, MN’ (ef. in particular 
(iv) in Lemma 3.6.3), it suffices to consider the case A ¢ DW. 

Ad (i): A = {x,(z); y€9}, A* = {x,-1(ey"); y A Hermitean means 
A = A*, that is = x,-1(zy"). For y = 1 this becomes xi(z) = (2); 
sO must be real. 


| 
| 
| 
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Ad (ii): Consider an f ¢ $s and define 


‘= f(x) fora =1 
F(a, a) 0 otherwise 


Then Fe 98 , and as A is definite, (AF, F)/8 = 0. Now one verifies easily that 
(AF, = [ | f(x) | dy. So we see: 


(96) [ xi(x) | f(x) |’ 20 forevery fe Ss. 
If M eT, »(M) finite, we can put 


fla) = { 
then f € Hs , and (96) becomes 


= lforreM 
= 0 otherwise 


(97) [ x(x) 2 0. 


Now x(x) is real by (i) above. Consider the 7, 7, .-- of (iii) in Defini- 
tion 3.2.4, and put M = (x; x(x) < 0)-T. Then u(M) S u(T), hence it 
is finite, so (97) holds. And as x(x) < 0 for all ze M, so u(M) = 0. Thus 
x(x) 2 0 for all x « 7 (exceptions as above!). Summing over i = 1, 2, --- 
gives xi(x) = 0, unrestrictedly. 

Ad (iii): Consider an ap € §, a9 ¥ 1, and twof,ges. Define 


= f(x) fora =1 
F(x, a) 4 = g(x) for a = a}. 
= 0 otherwise 


Then Fe 98 and, as A is definite, (AF, F)8 =0. Now one verifies easily, con- 
sidering x:(r) = 0 that 


(98) = dy + dy (2). 


Now by (i), xa,(2) = xXez1(za0"), so the second term in (98) is the complex 
conjugate of the first one. Hence (98) becomes 


(99) (AF, AS = 


By (99), (AF, F)8 = 0 states that 


(100) [ dyx = 0. 


| 
| | | 
| 
’ 
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If M «eI, u(M) finite, we can put 


= 1forreM 
f(z) 0 otherwise 


? 


= —sgn for eM” 
g(z) 0 otherwise 


then f, g € Hs , and (100) becomes 


(101) [| | dye 

Thus xa,(z) = 0 for all ze M (exceptions as above!). Now with the 7”, 
T®, ..+ of (iii) in Definition 3.2.4 we may put M = 7 and sum over i = 
2, 5 this gives xa,(x) = 0 unrestrictedly. 


_ So we hae Xe(t) = Ofora #1. Fora = 1it holds by assumption. Thus 
A = 0.— 

The statements which follow will be made for 9 only, but the automorphism 
Fa Win 98 (cf. in particular (iv) in Lemma 3.6.3) extends them immediately 
to IN’ too 

DeFIniTION 3.7.1. (i) We denote the set of all Lez) by &. 


(ii) For every M eT put E(M) = , where {= 


Lemma 3.7.3. (i) is an Abelian ring we have 
L= M-L. 
(ii) The projections in £ coincide with the E(M ). 

Proor: Ad (i): It suffices to prove £= OM- ‘©’: Since M- Lis is clearly a ring, 
it proves that © is also a ring, and implies ec OM, and also ec £&, that is, 
that £ is Abelian. 

Proof of CM-L’: La eI C = M, so LCM. Every com- 
mutes with every , and Li, = Ly , hence it is so Hence 
M-L. 

Proof of > M.-L’: Consider an A Since A eM, s0 A {x,(z); 
Since A £’, so A commutes with every Lye). Now Lye) ~ {o(x)5y; 
where 


lfory = 1 
= 0 otherwise | 


for oxo 
. So always | sgna <1 and 


2 For any complex a sgn a | | | 
=0 for a=0 


asgna = |al}. 
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So Led = € 9}, AL ge = € 9}. That is 


Put 


= {= 


= 0 otherwise |’ 
for an M eT, then (102) becomes 
(103) x(t) = 0 for or 


Now let M run over all elements of the finite or enumerably infinite system A 
in Definition 3.2.3; owing to (ii) eod. (103) becomes, if we sum over all M ¢€ A: 


(104) Xr(xz) = 0 
if zy ¥ x that is, if x ~ vy. Now if y ¥ 1, then since § is free, this means 
that (104) holds without any further So x,(r) = (6, as 
above), and hence A = Ly,;2) (see above), and therefore Ae&. 
Thus we have shown that 9N-L’ C L, and thereby completed the proof. 
Ad (ii): That an A ¢ & is a projection, means this: A = Ly) and A = A*, 
A =A’. But A* A’ = Lye): , so it means g(x) = = o(z)’. 
That is: g(x) = 0,1. But these g(x) are precisely the 


{= lforzeM 


= 0 otherwise 
for all M € I, and so the A are the L,,.2) = E(M) for all M eT 


CuapTeR IV. DETERMINATION OF THE CLASSES OF 9, ON’ 


4.1. The essential distinction which has to be made when determining the 
classes of MN, MU, is whether the S, I, u-group % is measurable or not. (Cf. 
Definition 3.3.6.) 

Consider first the case when $ is measurable. This means that a I'-measure 
v as described in Definition 3.3.6 exists, or, equivalently, that a function «(zx) 
as described in Lemma 3.3.3 exists. 

In this case it is possible to replace u by v altogether: § is an S, I’, »-group 
as well as an S, I’, u-group, and its being free and ergodic depends only on the 


sets of measure 0—which are the same for u and forv. s,r,, and 9%. 2 become 
Ds,r,, and 92. if we replace f(x) and F(x, a) by (x(x))* f(x) and («(x))*F(x, «) 
respectively. That is: This replacement compensates for the changes brought 
about by substituting v for » in s and 98. And finally (ii) in Lemma 3.3.3 


shows that this replacement has also the right effect in the definitions of U b's 
W, Lote) ; Mow . 


1 forzeM | 
| f 
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In other words: If § is measurable at all, then we may even assume that 1 
itself has the properties required for v, especially (ii) in Definition 3.3.6. That is: 


(105) = u(Ma) = = 1. 


This means, however, that for a measurable § our DN, MI’ coincide essentially 
with those of (1), pp. 192-204, and so the exhaustive discussion given loc. cit. 
disposes of our ON, M1 also in this case. Indeed, it is for the sake of the not 
measurable %, to be discussed in §§4.3-4.4 below, that this work was under- 
taken. (They correspond to case (III,,), cf. Theorem IX.) 

We shall nevertheless, for the sake of completeness, discuss the measurable % 
also, and that without bothering about replacing » by v. We shall, however, 
keep this discussion as short as possible, and not enter into questions which 
could also be answered now: Find criteria for normedness, expressions for 
Toy (A), [[A]], KA, ete. 


4.2. We assume throughout this section that § is measurable, and make use 
of the (essentially unique) v, x(x) of Definition 3.3.6 and Lemma 3.3.3. 
All our results, throughout §§4.2-4.4, will be stated and proved for 9M only: 


The automorphism f= WF of 98 (cf. in particular (iv) in Lemma 3.6.6) extends 


them immediately to SI’ too. 5 
DerinitIon 4.2.1. If A ¢€ MM is definite, then we have for A =~ {x,(x); y €9} 
(by (ii) in Lemma 3.7.2) x(x) 2 0. Hence we can certainly form the integral 


[ (ee) dr. 


Its sg value is = 0, but it may be finite or infinite. We denote this value 

by T'(A). 
Lemma 4.2.1. We have for A, B, C eM, A, B definite: 

(i) 05 7T°(A) +e. 

(ii) T°(A) > Oif A ¥ O. 

(iii) T°(A + B) = T°(A) + T°(B). 

(iv) C = {x,(x); 7 eS}. C*C, CC* are both eM and definite, and 


= T'(CC*) = dye. 


(Since all terms in this ye <Q are 2 0, tt ts clear what it means.) 
Proor: Ad (i): Immediate by (ii) in Lemma 3.7.2. 

Ad (ii): Immediate by (ii), (iii) in Lemma 3.7.2 

Ad (iii): Obvious 


| 
{ 
i 
ie 
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Ad (iv): O*€, CC* € MM along with ¢ MM, their definiteness is obvious. We have 
C = € 9}, C* = €G}, hence CC* = {p,(x);y € 9} with 
p(x) Hence for y = 1 


(106) p(x) = | xa(x) 


These Doae g converge “en mesure” in the sense of (iv) in Lemma 3.7.1. But in 
the oa. g of (106) all terms are 2 0, hence it converges numerically (to finite or 


infinite values). Hence both limits agree, and (106) holds in the sense of 
numerical convergence. 


Now = [ hence (106) gives: 
(107) T*(CC*) = [ | xa(x) |? x(x) 


We replace C by C*, that is x(x) by xa-1(za"). Then (107) becomes, if we 
also replace the summation index a by a’: 


(108) T(C*C) = [ | x(x) dy 


Now by (iii) in Lemma 3.3.1 and (ii) in Lemma 3.3.3 we have always [ flea) 
8 


a) de [ f(x)«(z) dye, hence the right sides of (107) and (108) are equal. 


Therefore they combine to 


(109) T(C*C) = T°(CC*) = | | x0(x) dz, 
g 8 


which completes our proof. 2 
Lemma 4.2.2. (i) If E runs over all projections « MN, then T°(E) is always 
defined, and it is a relative dimension function in the sense of (1), p. 165, Definition 
8.2.1. Wenow assign to the relative dimension function Day (E) this normalization: 
= T*(E). 


(ii) We have for the E(M) of (ii) in Definition 3.7.1., M «T: 


= f(a) = 


(ili) There exists an M €T with v(M) ¥ 0, ~. 


4 
+3 
i ul 
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Proor: We prove these statements in a somewhat changed order: 
Ad (ii): We have E(M) = Loy 2) , where 


{ lforze M 


= 0 otherwise | 
Now this means (cf. also the proof of (i) in Lemma 3.7.3) E(M) = Lp, = 
{Ou(x)6, ; y where 


lfory =1 
7 | = 0 otherwise |" 


So by Definition 4.2.1. 
(110) 1(E(M)) = [ = [ een = 


As soon as we shall have proven (i) and thereby established Doy(E(M)) = 
T° (E(M)), (110) will go over into the formula we want to prove. 

Ad (iii): Consider the T®, T®, -- - of (iii) in Definition 3.2.4. If all »(7) = 
0, then summation over all 7 = 1, 2, --- would give »(S) = 0, hence equiva- 
lently u(S) = 0, which is not the case. Hence there exists anz = 1, 2, --- with 
»(T) #0. Since we have also »(7”) # «, this completes the proof. 

Ad (i): Since every projection is definite, therefore for every projection 
E «M T(E) is defined (Definition 4.2.1) and = 0, S$ + © ((i) in Lemma 4.2.1). 
And T°(E) # 0, ~ does occur: Put FE = E(M) for an M from (iii) above, and 
use the already established part of (ii) above, (110): T°(E(M)) = »(M) + 0, . 

Therefore the character of T°(Z) as a relative dimension function can be 
established by (1), p. 170, Lemma 8.3.5: It suffices to show that T°(Z) fulfills 
the conditions (ii), (iii) in (1), p. 165, Definition 8.2.1. 

Proof of (ii) eod.: We must show that E ~ F(... ON) implies 7°(Z) = 
T°(F). By (4), p. 151, Definition 6.1.1, and p. 141, Lemma 4.3.1, it implies 
E = U*U, F = UU*. Hence T°(£) = T°(F) by (iv) in Lemma 4.2.1. 

_ Proof of (iii) eod.: We must show T°(E + F) = T°(£) + T°(F) for orthogonal 
E, F. Now this is immediate by (iii) in Lemma 4.2.1. 

This completes the proof. 

Coro.uary: SM belongs to the classes (I) or (II). (G is assumed to be meas- 
urable!) 

Proor: (ii), (iii) above together show that a projection M with Day (EL) 
~ 0, ~ exists. So 9 is not of class (III).— 

A precise determination of the class to which Sil belongs, when $ is measurable, 
could now be carried out. Owing to the above Corollary the possibilities are 
these: (In) (n = 1, 2, ---), (I,), (Il), (II,,). But since this discussion would 
coincide in all details with that one of (1), pp. 204-208, we omit it. The reader 
is referred to that work, and especially to the complete enumeration of the 
results in (1), p. 208, Theorem XII. (The réle of u there is played by our ».) 
That theorem shows in particular that all the cases enumerated above do actually 
arise when S, %, u are chosen suitably. 
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| 
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4.3. Later in this section we shall make the assumption that 9 is not meas- 


urable. 
Without knowing what the class of 9 is, we shall nevertheless use its relative 


dimension function Doy(#). (Cf. (1), p. 165, Definition 8.2.1, and p. 168, 
Theorem VII—it does not matter now in which normalization.) We shall see 
ultimately, in Lemma 4.3.5, that the class is (III,,), and hence the range of 
Do (EB) consists of 0, «& only—but this need not concern us now. 
We begin by investigating the Day(H#(M)), M « I (ef. (ii) in Definition 3.7.1). 
Lemma 4.3.1. (i) If Mi, Me, --- andi j implies = 9, then 


Doy(E(M, + M2 + ---)) = Dog (E(Mi)) + Don(E(M2)) 
(ii) If M eI, then for all aeG 
(E(M)) = Doy(E(Ma)). 


Proor: Ad (i): One concludes immediately from (ii) in Definition 3.7.1 that 
E(M) is the projection of the following closed, linear set I2(M): 


(111) E(M) = Pray, = (F;F(a,a) =0 if 


Now it is clear that under the above assumptions on M,, M2, --- Dt(M; + 
M. + ---) = [Mt(Mi), Mt(Me), ---], and the Mt(M:), --- are mutually 
orthogonal. Hence (1), p. 168, Lemma 8.3.2. gives Doy(E(Mi + Me+ ---)) = 
Dg (E(Mi)) + Doy(E(Mz)) + - -- as desired. 

Ad (ii): Ua (ef. (i) in Lemma 3.6.1) is unitary and e JC R(J) = Mi, and it is 
clear from (111) that VU. maps I(M) on M(Ma). Hence U,E(M) is partially 
isometric and has the initial set @(M) and the final set Mt(Ma). So M(M) ~ 
M(Ma) (.-- Ml) andhence Doy(E(M)) = (E(Ma)) by (1), p. 151, Definition 
6.1.1., and p. 165, (ii) in Definition 8.2.1. 

Lemma 4.3.2. There are only the two following alternatives: 

(a) For no M eT is Day(E(M)) 0, ~. 
(b) There exists a sequence M,, Mz, --- T,Mi + M2 + --- = S, such that all 
Dey (E(M,)) ~ 0, ~. 

Proor: Define a subsystem K of I as follows: M « K if and only if there exists 
asequence M,, Mz, --- el with M; + Mz +--- =M, suchthatall Doy(E(M,)) 
#0, 

Observe: 


(112) M°M®,...¢K imply €K. 


Indeed: Form the sequence M{”, M$” , --- for M (i = 1, 2, --- , ef. above), 
then M$”, i,j = 1,2, --- , written as a simple sequence M;, M2, --- will do for 
M® +M®4.... Next: 


(113) MeK implies MaeK forevery ae¥%. 


| 
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Indeed: If M,, Mz, --- do for M, then Mia, Moa, --- will do for Ma con- 
sidering (ii) in Lemma 4.3.1. And finally: 
(114) If MeK,MCM’, u(M’ — M)=0, then M’eK. 


Indeed: Form the sequence M,, M2, --- for M, then replace M; by M, + 
(M’ — M) and leave Mz, M;, --- unchanged: This sequence will do for M’. 
Consider now the T, T®, --- of (iii) in Definition 3.2.4 for u. Define: 


(115) a; = Least upper bound of (u(M-7); M € K). 


For i, 7 = 1, 2, --- we can choose an M;,; with 


(1i8) 


Write the M;,;, i, 7 = 1, 2, ---, as a simple sequence M, M®,.--. Then 
by (112) Mi; = M® +M® + ... €K, thatis 
(117) M K. 


t,j=1 


We have u(if-T) = u(Mi;-T®) = a; — Since this holds for all j = 1,2, 


., aad w(M-T) does not depend on j, so it implies u(M-T) = a;. By 
(115), (117) also n(M-T) S a;, so 
(118) u(M.T”) = a;. 


Now consider an M eK with M > M. Then we have M.T° D MT 
and both sets are C7”, so their u are finite. This, together with (115), (118), 
gives: 

u((M — M)-T®) = p(M.T® — M-T®) 
= n(M-T) — a; — a; = 0, 


that is: 


(119) u((M — M)T) = 0. 
Summing over 7 = 1, 2, --- and restating our hypothesis, we obtain: 
(120) If MeK and MDM, then u(M — M =0. 


(117), (118) give for every a Mae K. Hence by (112) M + Mae K (put 
M® = M,M® =M® =...=Ma). SoM = M + Ma fulfills the hypotheses 
of (120), and therefore 


(121) u((M + Ma) — M) = 0. 
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Replacing a by a” in (121) gives n((M + Ma") — M) = 0, hence u(((M + 
Ma") - M)a) = 0, that is 
(122) u((M + Ma) — Ma) = 0. 


Summing (121), (122) gives: . 
(123) + Ma) — (if. Ma)) = 0. 
Since (123) holds for all a eG, and since $ is ergodic, we must either have 
(124a) u(M) = 0 
or 
(124b) u(S — M) = 0. 
Case (124a): Consider any M eT with Doy(E(M)) ~ 0, ©. Then we see, 
by replacing the M,, M2, --- of M by M, Mi, Me, +++, that M + McK. ’ 
M + M fulfills the hypotheses of (120), hence u((M + M) — M) = 0. This | 
and (124a) give together «(M + M) = 0, hence u(M) = 0. Now (ii) in Defini- 
tion 3.7.1 shows E(M) = 0 and so Day (E(M)) = 0, which contradicts our as- 
sumption. 
So we have never Doy(E(M)) ¥ 0, ~, which is the alternative (a). 
Case (124b): (117), (124b), and (114) give together S«K, which is the 
alternative (b). 
Lemma 4.3.3. If the alternative (b) of Lemma 4.3.2 holds, then $ is measurable. 
Proor: For every M «¢ T define 


= Doy(#(M)). 


This vy is a [-measure, because it fulfills the conditions (i)—(iv) of Definition 
3.2.4. Indeed: 
Ad (i), (ii): Obvious. 
Ad (iii): Immediate by (b) in Lemma 4.3.2, if we put M,, Me, --- for T, 
Ad (iv): Coincides with (i) in Lemma 4.3.1. 

We now prove the measurability of § by showing that this y also fulfills the 
conditions (i), (ii) of Definition 3.3.6. Indeed: 

Ad (i): »(M) = 0 means Doy(E(M)) = 0, that is E(M) = 0, that is, by (ii) in 
Definition 3.7.1, L9,,(2) = 0. But this is clearly equivalent to u(M) = 0. 

Ad (ii): Coincides with (ii) in Lemma 4.3.1. 

This completes the proof.— f 

We next derive a property of the operation A'”*!*'" (ef. Definition 2.1.2). h ; 

Lemma 4.3.4. Since L OM is an Abelian ring (by (i) in Lemma 3.7.3), so every 
sequence of projections BE, , E2, --- ¢ & is commutative and hence for every A «MN | 
of finite rank A'*1'®*|-~- can be formed and is M (by Definition 2.1.2). : 

Put A = {xy(x);y = € 9}. 


q 
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Then 
xi(t) = x1 (x), except for an x-set of u-measure 0. 
Proor: All z-relations in this proof are to be understood to hold with the 
exception of z-sets of u-measure 0 only. 
We define 


(125) = {y™ (2): y € forn = 1,2,..., 
and proceed in several successive steps: ' 
Proof of = x{? (x): We have 
(126) = BAR, + I — 
Now EB, e &, so by (ii) in Lemma 3.7.3, = E(Mi) for an MieT. Thatis, 
E, = Ley, , 
bw, (x) = lforxre M, 
ms = 0 otherwise |" 
This means, as we know, EF; ~ €$}, 
PY lfory =1 
7 | = 0 otherwise |’ ( 
This and A = {x,(x); 7 € 9} give, owing to (126): 
= (ay) + (1 — Ou, (2)) (1 — Ou, € 9}. 
Hence by (125): 
(127) = (x) + (1 — Ou, — Ou, 
Put y = 1 in (127); then, since 6u(x) = 0, 1, it goes over into x{”(x) = xu(z). ( 
Proof of xi” (x) = (x) (for n = 1, 2,---): We proved x(x) = x{”(2). 
Replace A, EF, by then this becomes x{"(z) = x{"*? (z). ( 
Combining the two above results gives: | 
Proof of x1(x) = xi (x): We have (by Definition 2.1.2) 
That is: For every 99 || = 0, and hence 
for all F, H 
2 


(129) lim agg, — = 0. 
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Choose any f, g ¢ Ss , and define 
f(z) fora =1 
Fa, 0 otherwise |’ 
g(x) fora =1 
0 otherwise |’ 


then F, He SY, and 


Hence (129) becomes 
lim @) — xi f(z)g(z) d,x = 0, 


or, considering (128), 
(130 [ cae) dye = 0. 


If M eT, u(M) finite, we can put 


=1 for reM 
f(z) 0 otherwise 


sgn (x oy —xi(z)) for teM 
otherwise 


(cf. footnote”), then f, g ¢ Ss , and (130) becomes 


(131) xi(z) — | dx = 0. 
Thus = x1 (zx) for all z M (exceptions as above!). Now with the 7, 
ae - of (iii) in Definition 3.2.4 we may put M = 7°”, and sum over i = 1, 2, 
gives xi(x) = xi (zx) unrestrictedly. 
Thus the proof is completed.— 


We are now in the position to take the decisive step. 

Lemma 4.3.5. If G is not measurable, then OM is of class (III,). 

Proor: Assume that $is not measurable. We shall show that 9M is of class 
(III,,) by applying Lemma 2.2.2. For the @ mentioned there we shall choose the 
£ of (i) in Definition 3.7.1. So we must only prove the hypotheses of Lemma 
2.2.2 for @ = 


Lis an Abelian ring CON: By (i) in Lemma 3.7.3. 
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Then 
xi(t) = xi (2), except for an x-set of u-measure 0. 
Proor: All z-relations in this proof are to be understood to hold with the 


exception of z-sets of u-measure 0 only. 
We define 


(125) = (x); € 8} form = 1,2,-.-, 


and proceed in several successive steps: 
Proof of x:(x) = x{?(x): We have 


(126) A'® = + (I — — EB). 


Now BE, e &, so by (ii) in Lemma 3.7.3, E, = E(M;) for an MieT. That is, 
= Loy, (2) 


= lforre M, 
(x) = 0 otherwise |° 


This means, as we know, E; ~ {6u,(x)6, ;y € 9}, 


lfory =1 
Y | = 0 otherwise |" 


This and A = {x,(zx); vy € 9} give, owing to (126): 

= + (1 — (1 — € 9}. 
Hence by (125): 
(127) = Om, + (1 — Ou, (2) (1 — Om, 


Put y = 1 in (127); then, since 0u(x) = 0, 1, it goes over into x} (xz) = x1(z). 
Proof of xi" (x) = xi"*? (x) (for n = 1, 2, ---): We proved = x{?(z). 

Replace A, by then this becomes x{" (xz) = x{"*?(z). 
Combining the two above results gives: 


(128) x(x) = xi? (a) xi” (a) 
Proof of x1(x) = xi (x): We have (by Definition 2.1.2) 


lim str len 


That is: For every Fe 99 = 0, and hence 
for all F, KH 98 


(129) lim — = 0. 
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Choose any f, g € Ds , and define 


f(z) fora = 1 
Fa, 0 otherwise |’ 


g(x) fora =1 
{= 0 otherwise |’ 


then He 9%, and 


Hence (129) becomes 
lim Ga @) — xi (z)) f(a)g(x) d,x = 0, 


or, considering (128), 
(130 [ Gal) @)4@)9@) dye = 0. 


If M eT, u(M) finite, we can put 


=1 for reM 
f(z) 0 otherwise 


Guz) —xi(x)) for reM 


otherwise 


(cf. footnote”), then f, g € Ss , and (130) becomes 


(131) [ - | ae = 0. 


Thus mle) = xi(z) for all x eM (exceptions as above!). Now with the 7, 
rap - of (iii) i in Definition 3.2.4 we may put M = 7°”, and sum over i = 1, 2, 
this gives = xi (zx) unrestrictedly. 

Thus the proof is completed.— _ 

We are now in the position to take the decisive step. 

Lema 4.3.5. If § is not measurable, then OM is of class (III,,). 

Proor: Assume that Gis not measurable. We shall show that OM is of class 
(III ») by applying Lemma 2.2.2. For the @ mentioned there we shall choose the 
£ of (i) in Definition 3.7.1. So we must only prove the hypotheses of Lemma 
2.2.2 for@ = & 


Lis an Abelian ring CON: By (i) in Lemma 3.7.3. 


' 
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Ad (i): Thé ring MN-L’(CM) is purely infinite: By Lemmata 4.3.2, 4.3.3 
since is not measurable, the alternative (a) of Lemma 4.3.2 holds. But this 
means, considering (ii) in Lemma 3.7.3 and Definition 2.2.1, that £ is purely 
infinite. 

Now by (i) in Lemma 3.7.3, £ = Mi-L’, hence we have shown that M.-L’ is 
purely infinite. 

Ad (ii): For every finite projection P(e) F'*:'*2'" = 0 implies F = 0: 
Assume F as described. Put F ~ {xy(x); y = {x} (x); 
Then F'#:!#:!--- — Q implies xi(z) =0. (All z-relations are to be understood to 
hold with the exception of z-sets of u-measure 0 only.) By Lemma 4.3.4, 
xi(x) = xi (ax), hence x:(z) = 0. Now F being a projection, is definite. Hence 
x:(x) = 0 implies, by (iii) in Lemma 3.7.2, that F = 0. 

Thus the proof is completed.— 

We may sum up our results as follows: 


Tuzorem IX. Let S, I’, u be as in Theorem VIII, and form Ss, S° and 


OM, MU’ as described there. Then we have: 

If S is measurable (cf. Definition 3.3.6), then the factors MN, MW’ belong to the classes 
(I) or (II). The precise determination of their classes is given in (1), p. 204, 
Lemma 13.1.1, and p. 206, Lemma 13.2.1. (Cf. the end of our §4.2.) If S is not 
measurable, then the factors N,N’ belong to the class (III) (that is, (III,,)). 


Proor: It suffices to consider only: The automorphism of 


(ef. Theorem VIII) extends our results from ON to MW. 

Our statements concerning a measurable G result from the Corollary to Lemma 
4.2.2, and the discussion which follows, at the end of §4.2. Those concerning a 
non-measurable $ result from Lemma 4.3.5. 


4.4. Examples of S, I’, 1 and G as required by Theorems VIII, IX for any one 
of the classes (I)-(II)—that is, of free, ergodic, and measurable groups S$—were 
given in (1), pp. 206-208, Lemma 13.2.1, and the specific examples (a) — (7). 
So there is only one more thing left for us to do: To give examples of S, I’, » and 
§ as required by Theorems VIII, IX for the class (III)—that is: of free, ergodic, 
and non-measurable goups 9. 

A certain class of examples was mentioned in the Introduction, §4, but we 
shall not investigate it here. Our examples will be based on the following lemma: 

Lemna 4.4.1. Let S,T, and be as described in Theorem VIII. Denote the 
set of those a €G, for which always u(M) = p(Ma), by 8. Gp is a subgroup of § 
and a free 8,1, u-group along with S. It is also measurable, with u for v. (Cf. 
Definitions 3.3.3, 3.3.4, and 3.3.6.) Now we assume: 

(i) Gois ergodic. (Cf. Definition 3.3.5.) 
(ii) ¥ G. 
Then S is not measurable. 

Proor: The statements about % which precede (i), (ii) are obvious on the 

basis of the definitions referred to. 
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Assume now (i), (ii), and, per absurdum, the measurability of §. Consider 
the vy of Definition 3.3.6 for §. This v will also do for %. So u, v both do for 
G. Now {% is ergodic by (i), hence Lemma 3.3.4 applies for % in place of 9, 
and gives 4(M) = av(M) for a constant factor a > 0. Since we have for every 
ae9 v(M) = »(Ma) by the definition of v, so this gives u(M) = »(Ma), that is 
ae%. Hence % = 9%, which contradicts (ii). 

This completes the proof.— 

We can now give specific examples. 

EXAMPLE (a). Let S be the set of all real numbers, I the system of all Borel 
sets in S (in the usual sense), and » the common Lebesgue measure (for Borel 
sets). Consider the following one-to-one mappings of S on itself: 


(132) a(p,o):xeprta, p > 0, p, o rational. 


These a;(p, c) form an obviously enumerably infinite group. Let $ be this group- 
One verifies immediately that § is an S, I’, u-group, and also that it is free. 

Clearly u(Ma) = pyu(M) ‘for a = a(p, «), hence the set % of all « € with 

u(Ma) = »(M) consists of alla = a1, (p,c) withp = 1. So%& 

G is also an S, I, u-group, and ergodic by (1), p. 206, Lemma 13.2.1, and p. 
208, example (8). Hence % is a fortiori ergodic. 

So S, I’, w and & fulfill the requirements of Theorems VIII, [X, and % fulfills 
the conditions of Lemma 4.4.1. Hence $ is not measurable. 

Observe that in this case u(S) = @. 

EXxAmpLe (8): Let S be the set of all complex numbers z with | z | = 
the system of all Borel sets in S (in the usual sense). Let » be the common 
Lebesgue measure to be applied after S has been mapped by the mapping 
z = e*” on the set S, of all real numbers g = 0, < 1 which can be (and in what 
follows will be) looked at as the set of all real numbers (mod 1). 

Consider the following one-to-one mappings of S on itself: 


(133) ax(0, u) ja] =1, |u| <1. 
These a2(@, uw) form an obviously continuous group. Denote this group by 
8." Consider the following elements of $: The a = a2(6, u) with @ = e”, p 
rational, uv = 0, and that one with @ = 1, wu = 3. They form an enumerably 
infinite set, and consequently they generate an enumerably infinite subgroup of 
Let be this subgroup. 

In combining two a = a2(@, u) of (133), their 6 simply multiply. Hence the 
9 of alla = a2(6, u) eG are the products of the 6 of those a = a2(6, u) with which 
we started, that is: the @ = e”"”, prational. Therefore we have: 


(134) a = a2(0, 0) 9 if and only if 6 = ¢**, p rational. 
One verifies immediately that 9 is an S, I’, z-group, and also that it is free. 


'’ G plays a well-known and important réle in the theory of conformal mappings. 
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Let us now determine %%, a € % means that u(Ma) = u(M). Since a = a2(6, uv) 
maps arcs on arcs, this requires that it conserve their lengths, hence, that it bea 
rotation or a reflection. That is: 


or 2 |@| =1. 


(133) excludes the second alternative, and the first one means u = 0 for a = 
a2(0, u). So Gp is characterized by u = 0, and consequently by (134). So 
§ since a = a2(1, is but not &. 

Now map S by z = e””*” on the g-set S, (cf. above). Then the mappings of 
GY» go over from z @ ¢*”z (in (134)) into g = ¢ + p, p rational. This proves 
that %» is ergodic (in S,, and hence also in S), by (1), p. 206, Lemma 13.2.1, and 
p. 208, example (8). Hence $ is a fortiori ergodic. 

So S, I, u and G fulfill all requirements of Theorems VIII, [X, and % fulfills 
the conditions of Lemma 4.4.1. Hence $ is not measurable. 

Observe that in this case u(S) = 1.— 

Besides establishing the existence of factors 9 of class (III), the above 
examples can also be used to determine all existing types of factorizations Mu , 
M.. To this end we need the lemma which follows. 

Lemna 4.4.2. Consider the S, T, u, $, Go and MM, MN’ of Lemma 4.4.1. Every 
Ae W is A = {x,(x); € 9}; consider those for which = 0 whenever ¢ 
Denote their set by MN. Then we have: 

(i) Disa ring. 

(ii) OW-OV = (al) (a runs over all complex numbers). 

(iii) ON, Yl ts a factorization, but M, M1 are not coupled factors. (Cf. (1), p. 138, 
Definition 3.1.3.) 

(iv) The class of the factor 2 is (Il) or (II,,) according to whether u(S) is finite 
or not. 

Proor: Ad (i), (ii): Literally the same as the proof of (1), pp. 208-209, 
Lemma 13.4.1. 

Ad (iii): Literally the same as the proof of (1), p. 209, Lemma 13.4.2. 

Ad (iv): For A ¢ 9%, A definite, we have also A eM, A = {x,(zx); 7 € 9} (but 


= 0 for y ¢G!), and we define again T,(A) = d,x. Then the 
considerations of Lemmas 4.2.1., 4.2.2, apply literally in 91 (because for a €%b 


u(M) = u(Ma), and for all other a we have xa(z) = 0). Hence Dey (EF) = 


To(£) for all projections E «9. Hence we see, as in the Corollary after Lemma 
4.2.2, that 91 belongs to the classes (I) or (II). 

If MN were of class (1), then MU would be a normal factor by (1), p. 184, Lemma 
11.2.2, and hence Sl, 9 would be a coupled factorization by (1), p. 183, Lemma 
11.1.2. This contradicts (iii) above, hence 9 is of class (II). 


Now = To(Z) = [ 1-d,x = u(S), hence Mis of class (II;) or (1I,); 
according to whether yu(S) is finite or not.— 
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Thus Lemma 4.4.2 shows, if we avail ourselves of the above Examples (a), 
(8), that non-coupled factorizations of the classes (II,), (III,,) and (III,) 
exist. This shows that no analogue of (1), p. 182, Theorem X (which holds for 
coupled factorizations) holds for non-coupled factorizations. Indeed, the only 
restriction on non-coupled factorizations is that no factor can be of class (I) 
(by (1), pp. 183-184, Lemmas 11.1.2, 11.2.2, cf. the above proof of (iv) in Lemma 
4.4.2). We could also show the existence of non-coupled factorization of class 
(III,,), (III,,), but we shall not discuss this now. 
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be a sequence of numbers, where for all n 


Weyl’ calls such a sequence uniformly-dense in [0, 2x], if for every subinterval 
[a, 


(2) 


Weyl proved that the sequence Jt; is uniformly dense in [0, 27] if and only if 


LS 
(3) lim — = 0 
for every integer k. 
Suppose we are given on the unit circle of the complex z-plane a sequence 
of points 


ig (2) ip (2) 


with 
Osa” <Q” <... 


1H. Weyl: Uber die Gleichverteilung von Zahlen mod Eins, Math. Ann., 1916, Bd. 77, 


pp. 313-352. 
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for all n. The sequence is called uniformly dense on the unit circle, if for any 
are of the length 1 


l 
holds. It is satisfied if and only if the sequence of numbers 
Cw 
gl 


is, in the sense of (2), uniformly dense in [0, 27]. 
Let a closed Jordan-curve | of the complex ¢-plane be given. The sequence of 


points 


(n) n 


lying on I is called uniformly dense if, mapping the exterior of 1 schlicht-con- 
formally and the periphery continuously on the closed exterior of the unit 
circle of the z-plane, we obtain on the circumference of the unit circle a sequence 
of points 


Mm, =< 


( 
5 
(5) 20”, 26, 
ty | 


uniformly dense in the sense of (4). 

We have to explain the case, when / degenerates into an open are on the 
s-plane. In this case, in the same say as in (5), we obtain two z\” belonging to 
one ¢;". Let l be e.g. the interval [—1, +1]; in this case the mapping function 
being ¢ = a(¢ + 2) the connection between the ¢{” and 2”, i.e. between 
» and g,”, respectively, is 

= + = cos of” 


* The uniform-dense distribution of points lying on p, but with potential theoretic 
characterization, occurs at first in the investigations of Hilbert; see D. Hilbert: Uber die 
Entwickelung einer beliebigen analytischen Function einer Variabeln --- , Nachrichten von 
der K6niglichen Gesellschaft Gottingen, 1897, pp. 63-70. The above formalization, which 
1s equivalent to Hilbert’s, is due to L. Fejér, see L. Fejér: Interpolation und konforme 
Abbildung, Nachrichten van der Kén. Ges. Gottingen, 1918, pp. 319-331. 
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For fixed v and n, gs” has two values, which lie symmetrically with respect to 
gy = 1m; 80 we call the sequence of points 


¢{? 


(6) 


lying on (—1, +1], with 
>--. = -1, n=1,2,... 


uniformly dense, if for the sequence of numbers 


. 


) 
gs” 


. 


defined by’ 
= cosg,”,0 So)” =1,2,--- n,n =1,2,---, 


no us 


holds, where a, B] means an arbitrary subinterval of (0, 7]. 

To this definition, appearing artificial at the first moment, we may give geo- 
metric sense in the following way: Let us draw upon [—1, +1] a semicircle and 
project the point x!” upon the circle and obtain AS”. Then gs” means clearly 
the angle between the positive real axis and 0A{”. So the above definition 
means that a sequence of points in [—1, +1] is here called uniformly dense if, 
projected upon the unit circle, the projections are uniformly distributed. Accord- 
ing to this definition the most uniform distribution is the case ¢{”) = vx/(n + 1), 
(v = 1, 2,---n, nm = 1, 2,---), attained when the sequence of points in 
[—1, +1] is = cos vr/(n +1) (v = 1,2,---, n,n =1,2,---). Let us 
observe that for this sequence 


ont) = I] & = 


v=] 


where U,(¢) are the Tchebysheff-polynomials of second kind; U, (cos #) differs 
only in a factor from sin (n + 1)6/sin 0, which is independent of #; these poly- 
nomials are well known by their many important extremal properties. This 
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holds also in the important case go” = ~— . m™. In both cases exactly 


E of” (w fined), fall in any subinterval [a, of [0, «]; the bracket 
B-—a 


According to Weyl’s criterion the uniformly dense distribution of a sequence of 
points is assured by the asymptotic behaviour of certain sequences associated 
with it. Fekete‘ gives another criterion of the uniform dense distribution; he 


forms with the sequence of points 2; the sequence of polynomials 
waz) = (s — (n = 1,2, +--+) 


in the last expression means the largest integer contained in n. 


-{ 0 +1 


Fie. 1 


and shows that Qt; is uniformly distributed upon / when and only when at every 


fixed point 2 of 1 
lim (| wn(zo) |)" M, 


where M is the so-called transfinite diameter’ of I. If lis the interval [—1 +1] 
then M = 3; if lis a circle, the radius of which is 1, then M = 1. Thus Fekete’s 
criterion requires instead of asymptotic equalities only inequalities. 

Let us now consider the special case when / is the interval ie +1]. In this 


case we stated that the ideal case is obtained when |e=* 


(n fixed), fall in any subinterval [a, 8] of [0, x]. The above mentioned theorems 
give no account of the measure of the deviation from this ideal case. In this 
direction we prove the following 


n | numbers 


‘ We know this theorem only from an oral communication. 
* This notion was introduced by M. Fekete: Uber die Verteilung der Wurzeln bei gewissen 
algebraischen Gleichungen, etc., Math. Zeit., 1923, pp. 228-249. 
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THEOREM: Let the sequence of points 


(1) 


(n) 
On 


with 


i2n >: 22 


be given. Let us construct the matrix 
Cw 


=4 @ (n) ¢ 
Yl »P2 Qn 


with 


= cos 0s of” <7, p= n=1,2,-- 
If for the polynomials wn(¢) = []e-1 (¢ — £5") the inequality 
(7) | -1s¢s +1,2 = 1,2,---, 


holds, then for every subinterval [a, 8] of [0, 3] we have 


log 3 


Our proof differs thoroughly from that of Fekete and besides it is elementary. 
A(n) in (7) denotes any function of n tending monotonically to infinity and for 
which, following Tchebysheff, A(n) = 2. 

The proof requires a theorem of M. Riesz,’ the proof of which is so short that 
for sake of completeness we may reproduce it as a 

Lemma. Let the trigonometric polynomial f(g) of order n take its absolute 
maximum in [0, 2x] at ¢ = gp ; then the distance of the next root from this gp is at 
least x/2n to the right or to the left. Thus a fortiori: if f(g) takes its absolute mazi- 
mum between two real roots, then the distance between these roots is = x/n. 

Proor. Suppose that the theorem is false. Without any loss of generality 
we may assume g = 0, there is a maximum at ¢ = g and the value of this 
maximum is 1. Suppose, that the next root lies to the right. Then for ¢ = 0 


Mathematikerver, 1915, Bd. 23, pp. 354-368. 
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the curves y = f(y) and y = cos ng would have at least one point of intersection 
and according to the supposition at least three in [0, x/n]. There is evidently 
at least one point of intersection at each of the intervals 24/n], [2x/n, 34/n], 
... [(Qn — 2)x/n, (2n — 1)x/n] too; thus the trigonometric polynomial f(g) — 
cos ng, of order n, would have in [0, 2x] (2n + 1) roots, an obvious impossibility. 

Now we proceed to prove our theorem. It will be sufficient to prove the 
upper estimate, for if we have proved 


4 


ase(")<p 


for every subinterval [a, 8] between 0 and z, then 


(9a) > 1 =" < log 3 (n log A(n)) 
(9b) x 1 Sanh log 3 (n log A(n)) 
sx 


i.e. from (9a) ii (9b) 


ase ™)<p 0<¢\")<a Bse 


which implies the lower estimate. 
Let us now consider the upper estimate. Let [a, 8] be any subinterval of 


(0, x], but now we regard it as fixed. Let k = one 


integer, and consider the following extremum problem of the Tchebysheff type: 
determine the minimum of the absolute maxima of the polynomials f(¢) = 
c+ ag" + .-. + a, taken in [—1, +1] with the restriction, that f(¢) has in 
the interval [cos 8, cos a] = [a, b] , k + 21 roots, (counted by their multiplicity), 
where as a matter of fact k + 21 < n. By a well known argument the existence 
of this polynomial is assured. We shall prove that the extremum polynomial 
fi(s) takes its absolute maximum value with respect to [—1, +1] in each of its 
root-intervals [${$1, ¢£"] which is in the interior of [a,b]. For let the absolute 
value of the maximum of f,(¢) in [—1, +1] be M and, in [¢{3, o£], |A(O) | S 
M — », with » > 0. Then according to a fundamental theorem, for the poly- 
nomial 
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(where > 0), we have in [¢{3} — + | M — 4n, if suffici- 
ently small. On the other hand in the exterior of + since 


Ao) OE — — — \ 


(m) 
we have, fore < 
y+1 


) < (¢) n n 


This is less than M for sufficiently small «, which contradicts the fact that 
fi(g) is an extremum polynomial. Thus fi(cos #) is a polynomial of order n the 
k + 21 roots of which in [a, 6] determine intervals such that in each of these 
intervals f(cos #) takes its absolute maximum. 

Now we apply the theorem of M. Riesz formulated in the Lemma. According 


to this f(cos #) cannot have in the interior of [a, 6] more than k _ *n| =k 


roots i.e. 21 roots must be located at the borders and consequently their mul- 
tiplicity must be / at least at one of the borders. 

By the premise, by the definition of fi(¢), and by the — we obtain that 
if w,(¢) has k + 21 roots in [a, b], then 


10 
2" ” 


max |wn(¢)| = =. | = | |, 


where y,(¢) = ¢" + bi¢"** + --- + b, denotes the polynomial of degree n the 
absolute maximum of which is a minimum for polynomials of degree n having 
somewhere in [—1, +1] one root with multiplicity 1. As 


max le OL a) 


we have, evidently, 


where ¥2(¢) = ¢" + --- runs over the polynomials of order n having somewhere 
in [ pee r a root of the multiplicity 1. Let I,,(¢) denote the minimum value of 
La = gat if ¥s(¢) = ¢" + --- runs over the polynomials of degree n having 
at a fixed fin [—1, +1] a root of multiplicity 1. Then we have 


(11) = min (2 


zt <+1 


Qn 
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Let us now consider I,(fo). Every ¥s(¢) can be written in the form (¢ — {)' 
y(g), where = Thus 


1 2 Ui 


Let ¢ = AC a 7 which transforms [—1, +1] into the upper part of the unit 


circle of the z-plane. Then—for z = e'*— 


1 er + 1 1 


as |z| = 1, we have evidently for {> = cos ao 


min 


= min |yole) 


where the last minimum is to be taken amongst the polynomials y¢(z) = 2” 4+ 
--- of degree 2n having at z = e“*° and z = e ‘“° roots of multiplicity J. Thus 


1 
> | r(z) |’ de, 


where the minimum relates to the polynomials of degree 2n yx(z) = 2°" + --- 
having an Lfold root only at z = e*. But in this case, since ¥x(z) = 
(2 — we have 


where ¥s(z) runs over the staat a of degree (2n — 1) beginning with 2” '. 
Finally by replacing ¢ by a9 + ¢ + x we obtain 


(12) Lite) > min | yo(z) + 2|” dy, 


the minimum being taken for all polynomials ¥(z) = 2" ' + --- of degree 
(2n — 1). 

If p(y) defines in [0, 27] a non negative and L-integrable function then, after 
Szegé we may define a sequence of polynomials ¢o(z), ¢:(z), --- such that 


(13a) on(z) = +... m= 1,2, --- 


! 
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In this case ¢m(z) minimizes, for polynomials U(z) of degree m of the form 
U(z) =2"+ ---, the integral / i | U@) |’p(y) dg. For any other such poly- 


nomial may be reduced the form ¢m(z) + 4m—1(2), Where 4m—1(2) is a polynomial 
of degree (m — 1). Then by (13b) 


+ Pole) de 
| |’ p(y) dy + | dp + | dy 


= / | |’ p(y) dp + | |’ p(y) dp = | |’ de, 
jz|=1 |2|=1 


and equality holds only for 7»1(z) = 0. The expression on the right side of (12) 
takes its minimum value for the (2n — 1)" polynomial orthogonal to the weight- 
function p(y) = |1+2|*. According to a theorem of Szegé these polynomials 
may be expressed in terms of Jacobi polynomials but we prefer to present them 
in the form of an explicit integral interesting in itself. For m = 2n — I we write 


2n +1 2n + 1 


This expression is a polynomial; we prove it by showing that Fen4(—1) = 
Fony(—1) = = = 0. The first of these equations is an imme- 
diate consequence of (14). Since for 1 S v S$ 1 — 1 we have by (14) 


it is evident that the assertion holds for1 < S$ 1—1. On the other hand 
= 1)! (1 + 2)'2"", 


ie. evidently F{2..(—1) = -.- = FR'TP(-1) = 0. Hence the expression 
in (14) is a polynomial. 

We now prove that the coefficient of z2*”~’ in (14) equals 1. The coefficient in 
question is 
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which by the ee t = zw can be transformed into 


l 270 


And now we have to verify the relation of orthogonality. Let 


+ = v=0,1,--- (Qn —1-—1). 
Since, for|z| = 1, 
2l 
we have by (14) 
_ Panti(2) Fan4i(2) 


Hence if in F'an42(z) the coefficient of 2’, 2‘, .-. , pe equals 0, (13b) is verified. 
But according to the definition of Fen42(z) 


(15) Fangi(z) = + dt + [ + dt. 


Here the first integral is a polynomial of z of degree (1 — 1); thus it has no 
influence upon our assertion. The second one we transform by the substitution / 8 
t = zw into ba 


1 
(1 — (1 + dw. 


Thus the second term is a polynomial the lowest term of which is 2n, which 
establishes the orthogonality. 
The minimum value is given by 


2n—l 21 l 


Fon4i(z) dz 
je|=1 


= 2Qnl X (the coefficient of in 


which by the form of Fen4:(z) in (15) equals 


Qn [ (1 — dw = Qn Cr) | | 
(7) 
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By this and (12) 


ie., by (11), 


2n — v 


log 3 


Since S 2and log (1 +2) 2 % (if 0 S S 2), we have, (by (16)), 
2n 
A(n) > | 19891 4 
y=2n—1+1V 


2 72 


4 } 
iog 3 (n log A(n)) 


which establishes the result. 


cos nd 


Nore I. For the Tchebysheff-polynomial 7',(x), where 7',(cos 3) = — 


cos" + --- , we have in [—1, +1] evidently | 7,,(z) | i.e. 7',(x) approxi- 


mates the function y = 0 in Tchebysheff’s sense, the error being less then .. 


By the above argument it can be seen that the function y = 0 is to be approxi- 
mated not essentially worse, in Bessel’s sense, by a polynomial of the form 
x” + ---, even when the polynomial has somewhere in [—1, +1] a root the 
multiplicity of which is less than [./n]. We are of the opinion that this very 
probably holds also for the Tchebysheff approximation; i.e. there exists a poly- 


nomial of degree n f(z) = x" + --- , which has somewhere in [—1, +1] a root 
of the multiplicity [./n], and yet in [—1, +1] 
| 2"f(z) | < B, 


where B is independent of n. By this it is clear that in general, the above 
theorem is not to be improved. 

Nore II. Let w(x) = 2" + --- be the polynomial of degree n minimizing for 
polynomials of degree n of the form f(z) = x" + .-- the integral 


(17) = [ |* p(x) da, 
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where k is a fixed positive number, p(x) is L-integrable, and in [—1, +1]p(x) = 
m(> 0). According to a theorem of Fejér all roots of w,(x) are in [—1, +1]. 
Denote the absolute maximum of | w,(x) | by M; if this maximum is taken at 


1 
= 2 and one of the intervals E on? Xo |, | to, % + suppose the 


latter, lies in [—1, +1], then : 
1 zotin-2 
| wn(x) |* p(x) dx > m | wn(ax) |* de. 
th M. 
But by Markoff’s theorem | w,(x) | > in| 4 | 
1 ay 1 

[ 
(18) dr > m( 

On the other hand, if f(z) = T,(cos 3) = = , we obtain by the minimum-prop- ’ 
erty of wn(x) 
| 1 ‘ 1 1 

(19) [ | | p(x) dx < p(x) dz, 


i.e. by (18) and (19), for [—1, +1] we have 


n 


$M < pat). 


, Thus by the theorem mentioned above we obtain for the roots of polynomials 
XI- minimizing the expressions in (17) that if the roots on the n* polynomials are 
, and = cos 0 < < x, then for any fixed sub- 
interval [a, 6] 


4 | 1 < o(p, log n) 
he ase ("<p 
Ty i.e., roughly speaking, the distribution of the roots of the minimizing poly- 
ly- nomials is uniformly dense. Analogous theorems are to be deduced for the 
vot polynomials solving extremum problems of the Tchebysheff-type. 
INSTITUTE FOR ADVANCED Stupy, Princeton, N. J. 
AND Bupapgst VI., HunGaARY. 
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SUR QUELQUES NOTIONS FONDAMENTALES DANS LA THEORIE 
GENERALE DES OPERATIONS LINEAIRES 


Par Riesz 


(Received April 8, 1939) 


, Le présent Mémoire a pour but de développer et de poursuivre quelques idées 

late tS concernant les opérations linéaires, idées que j’ai soulevées dans une conférence 

. faite en 1928, au congrés international des mathématiciens 4 Bologne et dont 

1 je me suis servi pour décomposer ces opérations en des parties appartenant 4 
| des types particuliers.’ La je me suis borné, a titre d’exemple, 4 des opérations 


linéaires portant sur des fonctions continues et j’ai montré, entre autres, que 
les parties en question de l’opération linéaire peuvent étre caractérisées et 
obtenues sans avoir recours & |’expression analytique de l’opération sous forme 
d’intégrale de Stieltjes. Comme je |’ai déja dit dans ma conférence, une telle 
le méthode a l’avantage d’étre entiérement générale de sorte que l’on peut s’en 
i servir pour les opérations linéaires des fonctions définies dans des ensembles 
abstraits, comme les a étudiées M. Daniell en généralisant la notion d’intégrale.” 
Les hypothéses dont nous partirons dans le présent Mémoire, sont encore plus 
générales que celles auxquelles j’ai pensé au temps de ma conférence; ce n’est 
pas 4 l’intervalle ou aux ensembles de points que nous substituerons des en- 
Bt Ge sembles abstraits, ni méme les fonctions continues remplacées par une classe 
ay plus générale de fonctions, mais ce sont les fonctions elles-mémes dont le réle 
| bE sera joué par des éléments abstraits, d’ailleurs soumis 4 quelques trés peu 
d’hypothéses concernant leur addition. 

Notre théorie est en relation intime avec celle des modules semiordonnés et 
des lattis linéaires, récemment développée par MM. Kantorovitch, Freudenthal 
et Garrett Birkhoff.* La différence la plus essentielle entre les deux théories 
concerne leurs points de départ, la théorie des lattis étant fondée sur des hy- 
pothéses plus exigeantes que celles posées pour notre domaine fondamental, 
c’est l’ensemble de nos éléments abstraits. Malgré cela, les opérations linéaires 
que l’on peut définir dans ce domaine forment ce qu’on appelle un lattis linéaire 
complet, c’est-A-dire que pour l’ensemble de ces opérations, les hypothéses de 
autre théorie s’y réalisent d’elles-mémes. Tel est aussi le cas pour le pro- 


1F. Riesz, “Sur la décomposition des opérations linéaires,’’? Atti del congresso inler- 
nazionale det matematici, Bologna 1928, 3, pp. 143-148. 

2P. J. Daniell, ‘‘The Integral and Its Generalizations,” Rice Institute Pamphlet, 8 
(1921), pp. 34-62; ‘‘A General Form of Integral,’ Annals of Math., (2) 29 (1917-18), pp. 
279-294 ; ‘‘Further Properties of the General Integral,” ibidem, (2) 31 (1919-20), pp. 203-220. 

3 Pour le présent travail cf. en premier lieu: H. Freudenthal, ‘“‘Teilweise geordnete 
Moduln,” Proceedings Acad. Amsterdam, 39 (1936), pp. 641-651. L’auteur date l’origine 
de la théorie en question de ma conférence de Bologne. 
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longement de notre domaine fondamental qui se fait par une sorte de dualité, 
en échangeant le réle des éléments et des opérations. En révanche, la théorie 
des lattis qui est encore plus abstraite, pourra nous aider d’examiner de plus 
proche les rapports mutuels qui existent, au point de vue formel, entre nos 
résultats concernant les opérations linéaires. 

C’est en 1936 que j’ai achevé et présenté a l’Académie hongroise une premiére 
rédaction, en langue hongroise, du présent Mémoire.‘ La rédaction actuelle en 
différe, 4 part quelques modifications légéres ici et 1a, principalement en ce 
qu’elle comprend un chapitre consacré au calcul fonctionnel des opérations 
linéaires, calcul permettant de composer des opérations données d’une maniére 
trés générale. Quelques compositions de type particulier, comme les produits, 
- les puissances et les moyennes géométriques sont traitées déja dans la premiére 
édition par une méthode entiérement différente qui n’est pas applicable au cas 
général. Aussi, pour épargner d’interrompre |’ordre d’idées des divers chapitres, 
nous avons intercalé quelques théorémes plutét auxiliaires. Enfin nous avions 
i refaire le dernier chapitre qui rend compte de quelques idées vagues con- 
cernant le prolongement du domaine fondamental et dans lequel se sont glissées 
quelques indications erronées. 

Observons encore, ce qui est d’ailleurs manifeste, que dans les considérations 
qui suivent, on aurait pu remplacer les opérations numériques par d’autres 
dont les valeurs appartiennent 4 n’importe quel lattis linéaire complet. 


I. Le domaine fondamental; exemples 


Nous allons envisager des opérations définies dans un ensemble abstrait que 
nous appelons le domaine fondamental et que nous désignons par 2. Les 
applications les plus importantes de notre théorie étant celles relatives 4 des 
opérations fonctionnelles, nous désignerons les éléments de Q en général par les 
lettres f et g. Nous supposons définie pour le domaine & une addition faisant 
correspondre aux éléments fi et fe un élément f; + fe bien déterminé, corre- 
spondance qui aura & satisfaire aux hypothéses suivantes. 

1. Elle est commutative et associative: fi + fo = fe thi (ith) th = 
fi + (fe + fs). 

2. La relation hi + fe = hi + fs implique que So = fs ° 

3. Ily aun élément 0, tel que f + 0 = f pour tous les f et la relation f, + fe = 0 
implique que fi = fo = 0. 

4. Lorsque fi + fa = gi + ge, il existe quatre éléments fu , fiz , far , fox de sorte 
que fu + fis = fr, for + for = fo, fu + far = gi, + for = ge. 

Voici une forme plus générale de cette derniére hypothése qui s’en déduit par 
récurrence, ayant égard encore 4 l’hypothése 1. FEtant donnés des éléments f; , 
Gk (i 1,---, mk 1,---,) tels que 


= Gey 


‘Imprimée dans les comptes rendus de la Classe des Sciences Matematikai és Termé- 
szettudomdnyi Ertesité, 66 (1937), pp. 1-46. 
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il existe des éléments fix de sorte que 


fi = fi; fu = 


Considérons quelques exemples. Les plus simples sont l’ensemble des 
nombres réels non-négatifs, celui des entiers non-négatifs et celui des logarithmes 
des entiers positifs ou ce qui revient au méme, l’ensemble des entiers positifs, 
mais en remplagant |’addition par la multiplication et le zéro par l’unité. Outre 
ces exemples évidents, en voici quelques autres, plus intéressants au point de 
vue de la théorie que nous allons développer. Dans tous ces exemples, l’addition 
sera définie par l’addition numérique. Les trois premiéres hypothéses étant 
évidemment remplies, il ne s’agira que de vérifier l’hypothése 4. 

EXeEmMPLE 1. Les fonctions non-négatives et continues définies dans |’inter- 
vallea Sb. 

Lorsque fi + fe = 9:1 + ge = f, la décomposition exigée par l’hypothése 4 est 
fournie par 

0 quand f(z) = 0. 
Les relations exigées étant évidentes, de méme que la continuité des fonctions 
fix aux points x ot f(z) ne s’annule pas, nous n’avons qu’a examiner |’allure de 
ces fonctions aux zéros de la fonction f(x). Or soit x un tel point; alors f;(x0) = 0 
et comme d’autre part g(x) S f(z), il vient que f(x) S fi(x) et par conséquent 
fix(x) s’annule au point x d’une fagon continue. 
Voici encore un second procédé de décomposition. Posons 


= min (fi(z), 
fe=fi-fu, fa=g—fu, fo =f — max (f(z), g(z)).’ 
Alors il est manifeste que 
Jutfe=fi, futfi=n 
et les deux autres relations s’obtiennent en observant que l’on a toujours 
min (x, y) + max (z,y) = x+y. 


ExXEMPLE 2. Les fonctions rationnelles f(x) = p(x)/q(x) formées de poly- 
nomes p(x) = 0, g(x) > O(a Sz SD). 
La décomposition répondant a l’hypothése 4 s’effectue par le premier procédé 


5 min (a, y) et max (x, y) désignent respectivement la plus petite et la plus grande des 
valeurs xz, y; nous nous servirons de la méme notation pour désigner les bornes inférieures 
et supérieures des ensembles numériques finis ou infinis et cela en réservant les notations 
inf et sup pour quelques autres catégories d’ensembles. 
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appliqué dans l’exemple 1. Le second procédé est en défaut, les fonctions des 
types min (f, g) et max (f, g) n’appartenant pas en général au domaine envisagé. 

Exemp.e 3. Les fonctions esealier non-négatives définies dans l’intervalle 

ExrempLe 4. Les mémes fonctions, mais en convenant de considérer comme 
identiques deux fonctions quand elles ne différent que pour un nombre fini de 
valeurs de la variable. 

Dans ces deux exemples, la décomposition exigée par l’hypothése 4 s’effectue 
par l’un ou l’autre des procédés dont nous nous sommes servis dans |’exemple 1. 

Exemp.e 5. L’ensemble des fonctions harmoniques non-négatives, définies 
dans une région ouverte du plan ou de |’espace. 

La décomposition s’effectue par le second procédé, mais en substituant aux 
enveloppes min (fi , g:) et max (fi , g:) qui en général ne seront pas harmoniques, 
les deux fonctions harmoniques que nous convenons de désigner par ¢ = 
inf (fi, g:) et Y = sup (fi, g:) et dont la premiére est la plus grande parmi les 
fonctions harmoniques ne surpassant ni f; ni g: et la seconde se définit d’une 
maniére analogue. Ces deux fonctions peuvent étre construites par exemple 
en appliquant la méthode de balayage de Poincaré aux fonctions min (fi, g:) 
et max (f; , g:) dont la premiére est surharmonique, la seconde sousharmonique. 
La relation g + ¥ = fi + gi est une conséquence immédiate de la relation 
min (f1, g:) + max (fi, 91) = fi + gi, le balayage étant distributif et ne chan- 
geant pas le second membre de la derniére relation. Il convient d’ailleurs 
d’observer que la relation g + y = fi + gi se déduit aussi des propriétés extré- 
males des fonctionsgety. Eneffet, commeg S fiet S$ g1,onafitg—¢ 2 fi 
et 2 gi, donc fi + gi — ¢ 2 sup (fi, gi) = ¥, ce qui donne 


(1) fitn2zety. 
Linégalité opposée 
(2) 


s’obtient en partant de y = fi et = gi, ce qui s’écrit aussi fi + g. — ¥ SH 
et fi; il s’ensuit que fi + gi — S inf (fi, = ¢, ¢.-a-d. l’inégalité (2). 
La comparaison des inégalités (1) et (2) donne f; + gi: = ¢ + ¥, ce qu'il fallait 
prouver. 

Ajoutons que ce dernier raisonnement est valable aussi pour un domaine 2 
abstrait. En effet, convenons d’écrire fi = fe, fe S fi quand fe = fi — fs 
c’est-d-dire que fi = fo + fs et de dire briévement que fi est supérieur fe et fe 
est inférieur 4 f; et de parler conformément 4 ces conventions des éléments 
le plus grand et le plus petit d’un ensemble; alors notre raisonnement s’applique 
et fournit l’énoneé suivant. 

Etant donnés deux éléments f et g du domaine Q, supposons que parmi les élé- 
ments qui sont inférieurs d tous les deux il y en a un élément ¢ qui est le plus grand 
et que parmi les éléments supérieurs a f et g il y en a un, soit p, qui est le plus 
petit, alors 

etyv=frtg. 
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Si de plus nous désignons par 4’ l’hypothése que nous venons de formuler, 
supposée remplie pour tous les f et g, alors l’hypothése 4 est une conséquence 
des hypothéses 1-3 et 4’. Au contraire l’hypothése 4’ n’est pas une conséquence 
des hypothéses 1-4, comme le montre |’exemple 2. 

Exemp.e 6. L’ensemble des distributions de masses positives dans l’espace, 
données par exemple sous forme de fonctions additives d’ensemble yu(e) = 0 
pour tous les ensembles e ouverts et bornés; on suppose que u(e) satisfait aux 
hypothéses suivantes: 1) u(e’) < mu(e) lorsque e’ est compris dans ¢; 2) 
+ w(er + = + 3) peut étre approché indéfiniment 
par y(e’), e’ parcourant les ensembles ouverts compris dans e de méme que 
leurs frontiéres. 

L’hypothése 4 est satisfaite aussi pour ce dernier exemple, les distributions 
inf (u, v) et sup (u, v) correspondant aux distributions u(e) et v(e) étant définies, 
comme on sait, la premiére par la borne inférieure de y(e1) + v(e2), les ensembles 
ouverts €; , é2 variant sous la condition e; + e: = e, et la seconde par la borne 
supérieure de u(es) + v(es) oi es et e4 , sousensembles ouverts de e, varient sous 
la condition de ne pas s’empiéter. 

ExEeMpPLE 7. Tout systéme de fonctions f(x) non-négatives définies dans un 
ensemble abstrait, comprenant la fonction zéro et avec les fonctions f(x), g(z), 
leur somme f(x) + g(x) et leurs enveloppes min (f(x), g(x)) et max (f(z), g(z)). 
C’est 4 de tels systémes que s’applique la théorie générale de |’intégration due 
& M. Daniell. 

On arrive 4 d’autres exemples par des modifications évidentes des exemples 
que nous venons d’énumérer. Ainsi dans l’exemple 6, on pourrait partir, au 
lieu de l’espace entier, d’un domaine borné de cet espace ou d’une surface 
fermée ou d’un intervalle de droite et pour parler aussi de l’autre extréme, d’un 
espace & une infinité de dimensions ou encore d’un espace abstrait. Dans 
l’exemple 7 on pourrait convenir, comme nous |’avons déja fait dans l’exemple 4, 
de considérer comme identiques deux fonctions qui ne différent que pour un 
nombre fini de points z. Observons qu’un cas particulier important de 
exemple 7 est fourni par l’ensemble des fonctions non-négatives presque 
périodiques. Nous pourrions encore ranger dans cet ordre d’idées certaines 
variétés de transformations linéaires de l’espace de Hilbert, variétés qui ont . 
fait objet de travaux importants il y a peu d’années. 


II. Opérations linéaires positives; la plus grande minorante et la plus petit 
majorante 


Dans ce qui suit, nous allons considérer des opérations linéaires définies dans 
le domaine 2; cependant, jusqu’a la fin du chapitre V, il ne sera question que 
des opérations linéaires positives. Par l4 nous entendons les opérations A = 
A(f) = Af faisant correspondre 4 chaque élément f une valeur non-négative Af 
de sorte que A(f; + fe) = Afi + Afe. 

Lorsque, pour deux opérations, A; et Az, on a Aif < Asf pour tous les f, 
nous convenons d’écrire aussi A; S Az et A, = A; et de dire que A; est une 
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minorante de Az et que As est une majorante de A,. Un ensemble {A} d’opéra- 
tions A, en nombre fini ou infini, dénombrable ou non, sera dit majorable lorsqu’il 
existe une opération B telle que A < B pour tous les A. Les opérations B 
du type envisagé seront dites majorantes de l’ensemble {A}. D’une maniére 
analogue, lorsque B < A pour tous les éléments de l'ensemble {A}, on dira 
que B est une des minorantes de l’ensemble. Tout ensemble d’opérations 
linéaires positives est minorable, dans un sens évident, une minorante étant 
toujours fournie par l’opération 0 qui s’annule pour tous les f. 

Convenons encore de dire que la majorante B est la plus petite majorante 
de l’ensemble {A} et de la désigner par sup {A} lorsque’elle est une minorante 
de toutes les autres majorantes, ainsi que de définir d’une fagon analogue la 
plus grande minorante et de la désigner par inf {A}. 

TutoriME 1. Chaque ensemble {A} d’opérations linéaires positives admet une 
plus grande minorante, et chaque ensemble {A} majorable admet aussi une plus 
petite majorante. 

Démontrons d’abord la premiére assertion. A cet effet, nous envisageons 
toutes les sommes de la forme 


(1) Aifi 
ou m est arbitraire, ot les A; sont tirés arbitrairement de l’ensemble {A } et ot les 


f= 
sont des décompositions de l’élément donné f. Variée sous ces conditions, 
l’expression (1) admet une borne inférieure Bf et cela pour tous les f. Je dis 
que cette borne Bf, envisagée comme fonction de f, définit l’opération inf {A} 
exigée par notre théoréme. Tout d’abord, il est manifeste que Bf = 0 et plus 
généralement, comme 


Afi = = Of 


pour toute minorante C de l’ensemble {A}, on a aussi Bf 2 Cf pour toutes ces 
minorantes. D’autre part on a Bf < Af pour toute opération A; il suffit de 
poserm = 1, A; = A, f; =f. Donc il nous reste seulement 4 vérifier la relation 


(2) Bf + g) = Bf + Bg. 


A cet effet, observons d’une part que, toute décomposition f = rs ’ 
g= 23 gi donnant lieu 4 une décomposition de f + g en Lrf t+ Lig, 
on a immédiatement que 


(3) B(f +9) S Bf + Bg. 
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D’autre part, soit f + g = >? fi une décomposition de f + g; alors, par V’hy- 
pothése 4 faite concernant le domaine Q, il existe, comme nous I|’avons observé, 


des décompositions 


telles que fi + gi =fi(i=1,---,p). Or, B(f +g) étant la borne inférieure de 
Aifi = Ash + Avg = Bf + Bg 


il faut que 
+ 9) 2 Bf + Bg 


ce qui, comparé avec (3), donne la relation (2) qu’il nous fallait prouver. 

Quant 4 l’existence de la plus grande majorante, ce probléme se raméne au 
précédant et cela en considérant l’ensemble {Do — A} ot A parcourt l’ensemble 
{A} et Do désigne n’importe quelle majorante de ce dernier. Je dis que 
Vopération 


(4) E = Dy — inf {Dy — A} 
fournit l’opération sup {A}. En effet, on a évidemment 


pour tous les A; d’autre part, D, désignant l’une quelconque des majorantes 
de l’ensemble {A}, l’opération D, = inf (Do, D1) sera une majorante du méme 
ensemble et par conséquent, 


E = Do — inf {Dp — A} S Do — (Do — D2) = D2 SD 


c’est-a-dire que E est bien la plus petite parmi les majorantes de l’ensemble {A}. 

La construction de l’opération sup {A} = E montre aussi immédiatement 
que Ef est la borne supérieure des valeurs de (1). 

Voici encore deux relations, conséquences immédiates des raisonnement qui 
précédent; comme nous aurons 4 nous y reporter 4 plusieurs reprises, nous les 
énoncons sous forme de théorémes. 

THEOREME 2. Pour deux opérations linéaires positives A, , Az on a toujours 


(5) inf (Ay Ag) + sup (A, Ag) = Aj + Ae 


Tutorbme 3. Etant donné wn ensemble {A} d’opérations linéaires positives 
A et en désignant par {A + Bo} l’ensemble obtenu en ajoutant a chaque A Vopéra- 
tion linéaire positive Bo , toujours la méme, on a 


inf {A + Bo} = inf {A} + Bo 
et quand Vensemble {A} est majorable, on a aussi 
sup {A + Bo} = sup {A} + Bo. 
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D’une facon plus générale, lorsque {A} et {B} sont deux ensembles d’opérations 
linéaires positives et {A + B} l'ensemble de toutes les opérations A + B que l’on 


en peut composer, alors on a 

(6) inf {A + B} = inf {A} + inf {B} 

et quand les deux ensembles sont supposés majorables, on a encore 
sup {A + B} = sup {A} + sup {B}. 


Le théoréme 2 se déduit immédiatement de la formule (4) en y posant Dy = 
A, + Ae, ce qui donne la relation 


sup (Ai, As) = = A; + Az — inf (Ae, Ai), 


équivalente 4 la relation (5). 

La premiére partie du théoréme 3 se déduit du fait que les opérations 
inf {A + Bo} et sup {A + Bo} se calculent comme inf {A} et sup {A}, en 
remplagant l’expression (1) par 


(A; + Bof; = + Bof. 


’ La seconde partie se raméne & la premiére en partant du fait évident que 
inf {A + B} S inf {A + Bo} et inf {A} <S Ao, inf {B} < By pour chaque 
choix particulier des opérations Ao, Bo appartenant respectivement aux en- 
sembles {A}, {B}. En effet, Vinégalité 


inf {A + B} — inf {A} <S inf {A + B} — inf {A} = By, 
vraie pour toute opération By tirée de {B}, nous donne 
(7) inf {A + B} — inf {A} S inf {B}. 
D’autre part on a 
inf {A} + inf {B} S Ao + Bo 
pour tout choix pastioadlan des opérations Ao, Bo et cela donne 
(8) inf {A} + inf {B} S inf {A + B}. 


La comparaison des inégalités (7) et (8) fournit la relation (6) et un raisonne- 
ment analogue s’applique au cas des plus petites majorantes. 

Notons que l’on aurait pu aussi déduire les théorémes 2 et 3 de l’énoneé du 
théoréme 1, sans faire appel aux détails de la démonstration. 

Voici encore un corollaire du théoréme 3. 

Pour trois opérations linéaires positives A, B, C on a toujours 


(9) inf (A, B + C) S inf (A, B) + inf (A, C). 
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En effet, en remplagant les ensembles {A}, {B} par les couples A, B et A, C, 
la relation (6) s’écrit 


inf (2A, A + B, A +C,B+C) = inf (A, B) + inf (A, C) 


et en observant que A S$ 2A,A SA+BetA SA + C et que, par consé- 
quent, 


inf (A,B + C) S inf (2A,A + B,A+C,B+0), 


il en découle l’inégalité (9). 

Disons encore quelques mots sur certains cas particuliers du théoréme 1. Etant 
donné l’ensemble {A}, envisageons, pour chaque élément f, la borne inférieure 
des valeurs Af que nous désignerons par min {Af} pour la distinguer de inf { A}f, 
valeur que prend |’opération inf {A} pour l’élément f. Ces deux opéra- 
tions ne coincident que si la premiére est distributive, c’est-d-dire que si 
min {A(f + g)} = min {Af} + min {Ag}. Elles coincident, entre autres, 
lorsque l’ensemble {A} comprend, pour tout couple A; , As de ses éléments, au 
moins une minorante A Ai, A S Az. En effet, on voit immédiatement que 
sous cette hypothése il y a aussi, dans l’ensemble {A}, une minorante Ao pour 
tout n-uple Ai, --- , A, tiré de l’ensemble; par conséquent on a 


Aafi = Z min (Af), 


et notre assertion s’en déduit immédiatement. On a donc le 
TuHtoriMe 4. Lorsque l’ensemble {A} comprend, pour chaque couple Ai, As 
de ces éléments, un élément A = Ai, A S Az, alors on a, pour tous les f, 


inf {A}f = min {Af}. 


Evidemment, un fait analogue est valable pour les majorantes. Notons en 
particulier le cas ot les éléments A dépendent d’un indice ou d’un parameétre 
continu  allant vers l’infini et A, est monotone par rapport 4 c’est-d-dire 
que l’on a, pour \ < y, ou bien constamment A, S A, ou bien l’inégalité inverse. 
Dans ces cas, au lieu d’écrire inf {A} et sup {A}, nous pourrons nous servir 
aussi de l’écriture évidente lim A,, A, — lim A, etc. Convenons encore 
d’écrire B = 2, A; dans le cas ot les opérations positives A; sont telles que la 
série Bf = Af est convergente pour tous les f; plus généralement, nous 
écrirons B = >> A aussi lorsque l’ensemble {A} qui peut étre infini non dénom- 
brable, est tel que, pour chaque f, il n’y a qu’un nombre fini ou une infinité 
dénombrable d’opérations A pour lesquelles Af ne s’annule pas et que de plus 
la série formée de ces valeurs Af est convergente. | 


III. Opérations disjointes 

Les opérations linéaires positives A et B seront dites d’étre disjointes et nous 
écrirons A || B ou B|| A, lorsque inf (A, B) = 0, c’est-a-dire que A et B 
n’admettent aucune minorante outre 0. D’aprés le théoréme 2, cette hypothése 
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s’écrit aussi sup (A, B) = A + B. La construction de la plus grande minorante 
donnée au chapitre précédent met en évidence le critére suivant. Pour que 
A || B, il faut et il suffit que pour chaque élément f et pour chaque « > 0 il existe 
une décomposition f = fi + fe, telle que 


Afi S «, Bfe «. 


I] suit immédiatement de la définition que si A || B et Ai S A, B, S B,ona 
aussi A; || B:. Une autre conséquence immédiate consiste en ce que A || B 
et a = 0, b = 0 impliquent que aA || bB. En effet, en posant max (a, b) = c, 
on a 


inf (aA, bB) S inf (cA, cB) = c inf (A, B) = 0. 


TuforEME 5. Lorsque A || Bet A B+ C, onaaussiA < C. 
En effet, de A S B+ Cet B < B+ C on déduit que 


A +B = sup (A, B) 


done A S C. 
Tatorime 6. Lorsque Ax || B(k = 1, --- ,n), alors on a aussi Ax || B. 
Il suffit de considérer le cas n = 2, auquel le cas général se raméne par ré- 
currence. Posons done C = inf (A; + Az, B);il nous faut montrer que C = 0. 
Or C < B\| Ai, done C || A; et comme C S A; + Az, le théoréme 5 fournit 
que C S A,. Par conséquent 


C &S inf (Az, B) = 0. 


Tutorime 7. Lorsque l'ensemble {A} est majorable et que lon a A || B pour 
une opération B et pour tous les A, alors on a aussi sup {A} || B. 
Ce théoréme est aussi un corollaire du théoréme 5. Posons, en effet, 


inf (sup {A}, B) = D; 
alors on aura, pour tous les A, 
A S sup {A} = D + (sup {A} — D); 


comme, d’autre part, D < B|| A entraine que A || D, on obtient du théoréme 
5 que 
A S sup {A} — D 


pour tous les A, ce qui donne D = 0. 

Observons que le théoréme 6 s’étend d’une maniére évidente au cas d’une infinité 
@opérations A, , méme au cas d’une infinité non dénombrable d’opérations A 
dont nous avons parlé a la fin du chapitre précédent; bien entendu on aura a 
supposer la convergence de >, A. En effet, > A nest que la plus petite 
majorante de l'ensemble des opérations > 1 A; , formées de toutes les maniéres 
possibles d’éléments A; tirés de l’ensemble {A}, et notre assertion se déduit 
de ce fait par les théorémes 6 et 7. Un cas particulier important est celui ot 
ensemble {A}, supposé majorable, est formé d’opérations A disjointes deux a 
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deux; dans ce cas on a évidemment >> A = sup {A} et notre assertion est 


comprise dans le théoréme 7. 
TutorbmMe 8. Pour deux opérations linéaires positives A, B, d’ailleurs quel- 


conques, on a 
sup (A, B) — A = B — inf (A, B) || A — inf (A, B) = sup (A, B) — 
Pour le voir, posons C = inf (Ai, B:) avec A: = A — inf (A, B), B = 
) — inf (A, B); nous avons 4 montrer que C = 0. Or, comme A; 2 C, B, 2 C, 
| nous obtenons 


| A = inf (A, B) + A; 2 inf (A, B) + C, 
Bp B = inf (A, B) + B, 2 inf (A, B) + C, 


done 


inf (A, B) = inf (A, B) + C 


if et de la C = 0. 
Tutorime 9. Lorsque B || C, on a pour tout A 


q inf (A, B + C) = inf (A, B) + inf (A, C). 


f | En effet, l’hypothése faite assure que inf (A, B) || inf (A, C); de 1a et des 
relations évidentes 


inf (A, B + C) 2 inf (A, B), 
on obtient 
inf (A, B + C) = sup (inf (A, B), inf (A, C)) = inf (A, B) + inf (A, C) 


et la démonstration s’achéve en comparant cette inégalité avec l’inégalité (9) du 
chapitre précédent. 


IV. Parties disjointes des opérations linéaires 


A et B étant des opérations linéaires positives telles que A < Bet A || B — A, 
nous convenons de dire que A est une partie disjointe de B et d’écrire A < B. 
Il est manifeste qu’on aura en méme temps B — A < B. 

Tutorime 10. Lorsque A < B < C, ona aussi A < C. 

En effet, comme B || C — Bet A < B, on aura aussi A Il C — B; comme 
de plus A || B — A, on déduit du théoréme 6 que 


A||(C B)+(B-A)=C-A 


c’est-a-dire que A < C. 

THEOREME 11. Lorsque A B Cet A < C,onaaussiA < B. 

En effet, d’aprés les hypothéses faites, on a A ||C — Aet B— ASC-A 
et par conséquent A || B — A, c’est-d-dire que A < B. 

Tutorime 12. Lorsque A < B pour tous les éléments A de l’ensemble {A}, 
alors inf {A} < Bet sup {A} < B. 


inf (A, B + C) 2 inf (A, C) 
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Posons inf {A} = C, sup {A} = D. Alors par l’hypothése faite C < 
A || B — A pour tout A, done C || B — A et par le théoréme 7 il s’ensuit que 


C || sup {B — A} = B — inf {A} = B-C, 


c’est-d-dire que inf {A} = C < B. Pour obtenir de 1a la seconde assertion du 
théoréme, on n’a qu’a appliquer le résultat obtenu 4 l’ensemble {B — A}; en 


effet, 
B — D=B — sup {A} = inf {B — A} < B, 


done aussi D < B. 

TuftorkME 13. Lorsque A # 0, A < B et que a et b sont des coefficients nu- 
mériques non-négatifs tels que aA bB, alors on a aussi a b. 

‘En effet, de A || B — A il s’ensuit que aA || b(B — A); comme de plus 
b(B — A) S bB et aA S bB, on obtient que 


aA + b(B — A) = sup (aA, b(B — A)) S bB 


c’est-a-dire que aA S bA. Or, soit f un élément tel que Af ¥ 0, done Af > 0, 
alors aAf S bAf donne que a S b, c. Q. F. D. 


V. Familles complétes et familles presque complétes 


Un ensemble d’opérations linéaires positives sera dit de former une famille 
complete lorsqu’il comprend 

1) toutes les minorantes de chacun de ses éléments, 

2) la somme de deux quelconques de ses éléments et 

3) les plus petites majorantes de ses sous-ensembles majorables. 

Voici quelques exemples: a) l’ensemble de toutes les opérations linéaires 
positives d’un domaine fondamental Q; b) l’ensemble de toutes les opérations 
qui s’annulent pour certains éléments f donnés, en nombre fini ou infini; c) l’en- 
semble des opérations qui sont disjointes 4 certaines opérations données, en 
nombre fini ou infini. L’exemple a) est évident et les hypothéses 1) et 2) sont 
aussi évidemment réalisées par l’exemple b); pour vérifier dans ce cas l’hy- 
pothése 3), nous n’avons qu’A rappeler le procédé par lequel on construit la 
plus petite majorante. Quant a l’exemple c), l’hypothése 1) est une consé- 
quence immédiate de la définition des opérations disjointes et les hypothéses 
2) et 3) se vérifient par les théorémes 6 et 7. 

THtoREME 14. Lorsque l'ensemble {A} forme une famille complete, alors toute 
opération linéaire positive B se décompose, d’une facgon unique, en deux parties 
disjointes A, et B, = B — A, ot A; appartient a la famille {A} et B, || A pour 
tous les A. Les opérations A; , B, ne sont que les plus grandes parties disjointes 
de Vopération B, la premiere parmi celles qui appartiennent a la famille {A}, 
la seconde parmi celles qui sont disjointes a tous les A. 

Pour démontrer ce théoréme, désignons par A; la plus petite majorante de 
ceux des éléments A de l’ensemble {A} pour lesquels A < B, et posons B, = 
B— A,. Alors, par ’hypothése 3), A: appartient A l’ensemble {A} et il nous 
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faut encore montrer que B, || A pour tous les A. Or, par les hypothéses 1) 
et 2), les opérations 


A, + inf (A, B) 
appartiennent aussi 4 l’ensemble {A}; comme de plus 
A, + inf (A, Bi) A1+B = 8B 
alors, de par la définition de Au, 
A; + inf (A, Bi) S Ai 


et par conséquent, inf (A, Bi) = 0, c’est-d-dire que B; || A, ce qu’il fallait 
démontrer. 

L’unicité s’ensuit immédiatement du théoréme 5. 

La derniére assertion du théoréme vient immédiatement de notre démonstra- 
tion en ce qu’elle concerne l’opération Ai. Quant a l’opération B,, soit B, 
une partie disjointe de B (ou plus généralement soit B, < B) et supposons que 
B, || A pour tous les A. Alors comme Bz, S B = A; + B, et Be || Ax, il suit du 
théoréme 5 que B, S B, et comme encore B, est elle-méme une partie disjointe 
de B, notre assertion est démontrée. 

La décomposition d’une opération linéaire portant sur les fonctions continues 
en des parties appartenant 4 trois classes particuliéres importantes dont j’ai 
parlé dans ma conférence de Bologne, est une des applications les plus carac- 
téristiques du théoréme 14. 

Tutorime 15. Etant donné Vensemble {A}, désignons par A’ les opérations 
qut sont disjointes a toutes les opérations A et par A” celles disjointes d toutes 
les A’. Alors les ensembles {A'} et {A’’} forment des familles completes et {A"’} 
est la plus petite famille complete embrassant l'ensemble {A}. 

Observons tout d’abord que l’existence d’une plus petite parmi les familles 
complétes embrassant l’ensemble {A} est évidente, cette plus petite famille 
n’étant que la partie commune de toutes ces familles complétes. 

Quant a l’ensemble {A’}, pour voir qu’il forme une famille compléte, nous 
n’avons qu’a rappeler notre exemple c); il en est de méme pour l’ensemble {A’’}. 
Quant 4 la derniére assertion du théoréme, soit [A] la plus petite famille com- 
pléte embrassant l’ensemble {A}; il faut montrer que {A’’} = [A]. D/’abord il 
est manifeste que la famille {A’’} comprend l’ensemble {A}; ce qui reste 4 
prouver c’est que toute opération A’’ est comprise dans la famille [A]. A cet 
effet, soit A; l’une quelconque des opérations A” et mettons-la, conformément 
au théoréme 14, sous la forme Aj = A, + B,, A; appartenant A la famille [A] 
et B, étant disjoint 4 tous les éléments de cette famille, donc & plus forte raison 
& tous les éléments de l’ensemble {A}, c’est-d-dire que B; est un A’. Comme 
encore B, < Aj, alors B, appartient aussi A la famille {A’’}. Par conséquent 
B, || Bi, done By, = 0, c’est-A-dire que Aj = A, et ainsi Aj’ appartient A la 
famille [A], ce qu’il fallait démontrer. 

Observons encore que lorsque l’ensemble {A} forme lui-méme une famille 
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complete et que par conséquent {A} = [A], notre théoréme se réduit au corollaire 
suivant. A toute famille compléte {A} il correspond une famille complémentaire 
{A’} bien déterminée, formée par les opérations disjointes a toutes les opérations A, 
et la famille complementatre qui correspond a la famille {A'} est fournie par la 
famille primitive {A} elle~méme. 

Voici encore un second corollaire, énoncé sous une forme particuliére dont 
nous aurons besoin dans la suite. Lorsque A || B et que [A] et [B] désignent les 
plus petites familles completes embrassant respectivement les opérations A et B, 
alors [A] || [B], ce qut veut dire que tout élément de la famille [A] est disjoint a 
tout élément de la famille [B}. 

Ici peut-étre convient-il d’observer que la plus petite famille compléte [A] 
déterminée par une seule opération donnée A peut étre trés peuplée. Voici un 
des exemples les plus caractéristiques, cas particulier de l’exemple 7 dont nous 
avons parlé au chapitre I. Soit 2 l’ensemble des suites x = (zx) formées de 
termes non-négatifs, 4 somme finie, l’addition de deux suites se faisant comme 
d’ordinaire. On voit aisément que les opérations linéaires positives A dans 2 
s’expriment sous la forme Az = > axx, od (ax) est une suite bornée A termes 
positifs, déterminée par l’opération A et que, inversement, toute telle suite (a,) 
donne naissance 4 une opération A. Aussi est-il évident que la plus petite 
minorante des opérations A = (a,) et B = (bk) correspond a la suite 
(min (a, , b,)) et que, par conséquent, l’opération A; = x, pour laquelle 
a, = --- = 1 n’est disjointe 4 aucune des opérations sauf 0. C’est-a-dire que 
dans le cas envisagé la plus petite famille compléte [A1] qui embrasse A; , épuise 
toutes les opérations possibles. Observons encore que pour le domaine fonda- 
mental Q figurant dans |’exemple 1, formé par les fonctions continues non- 
négatives f(x) définies dans l’intervalle a < x < b, la plus petite famille com- 
pléte [J] embrassant l’intégration 


n’est que l’ensemble des opérations 
(2) [ g(x) f(x) da 


correspondant 4 des fonctions g(x) non-négatives et intégrables au sens de 
Lebesgue, d’ailleurs quelconques. 

Disons maintenant quelques mots des familles presque complétes que nous 
définissons en supprimant l’hypothése 3). L’ensemble {A} sera done dit de 
former une famille presque compléte lorsqu’il comprend 

1) toutes les minorantes de chacun de ses éléments et 

2) la somme de deux queleconques de ses éléments. 

Une famille presque compléte, de méme que les familles complétes, contient 
la plus petite majorante d’un nombre fini de ses éléments, Ai, --- , An, ce qui 
suit de la définition en observant que cette majorante est une minorante de la 
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somme A, + --. + A,, mais ce fait ne subsiste plus nécéssairement pour une 
infinité d’éléments. 

L’existence d’une plus petite famille presque compléte embrassant l’ensemble 
donné {A} est manifeste et cela pour les mémes raisons que pour la famille 


compléte. De plus l’hypothése 2) met en évidence que cette famille devra 


comprendre toutes les opérations de la forme 
(3) aA; +--+ + GnAn 


ot A;, --- , An sont des opérations appartenant 4 l’ensemble {A} et a, --- , a, 
désignent des nombres entiers positifs. D’autre part, d’aprés l’hypothése 1), 
notre famille devra aussi comprendre toutes les minorantes des opérations (38). 
Mais il est évident que l’ensemble de ces minorantes satisfait aux hypothéses 1) 
et 2), c’est-A-dire qu’il fournit précisément la plus petite famille en question. 

En particulier, la plus petite famille presque compléte embrassant une opéra- 
tion donnée A se compose de toutes les minorantes de A et de leurs multiples 
ou ce qui revient au méme, elle n’est que l’ensemble des opérations B bornées 
par rapport 4. A,B < cA. Ainsi par exemple, pour l’opération J donnée par la 
formule (1), ¢’est-a-dire l’intégration, la plus petite famille presque compléte 
qui l’embrasse est formée par celles des opérations (2) qui correspondent 4 des 
fonctions g(x) bornées. 

Convenons encore d’appeler, dans la suite, famille de l’opération A la plus 
petite famille compléte qui l’embrasse. Nous venons de voir que cette famille, 
et méme la plus petite famille presque compléte embrassant une opération, 
peuvent engendrer toutes les opérations du domaine envisagé. Parlons main- 
tenant de l’autre possibilité extréme. Supposons que |’opération A n’admette 
pas d’autres minorantes que celles de la forme cA (0 S ¢ S 1), alors il est 
manifeste que les deux familles en question se bornent aux mémes opérations cA ; 
bien entendu, c varie de 0 a l’infini. Tel est le cas dans nos exemples 1)-3) 
pour l’opération A définie par Af = cf(a) ou plus généralement par Af = ef(2%) 
(c 2 0). Quant 4 l’exemple 4), celui-ci ne donne pas lieu 4 de telles opérations 
“‘singuliéres.”’ 

Enfin observons que l’on peut passer, comme on montre immédiatement, de 
la famille presque compléte déterminée par une opération A A sa famille [A], et 
cela en y ajoutant les plus petites majorantes des sous-ensembles majorables. 
Il en est de méme pour les plus petites familles compléte et presque compléte 
qui embrassent un ensemble {A} d’opérations données. 


VI. Opérations linéaires de signe quelconque 


Passons aux opérations de la forme A = B — C ot B et C sont des opérations 
linéaires positives mais sans que l’on ait nécessairement C < B. Alors l’opéra- 
tion A ne sera pas nécessairement positive; convenons de |’appeler tout simple- 
ment opération linéaire. 

Pour qu’une opération A appartienne 4 la classe envisagée, il faut et il suffit 
qu’elle soit distributive, A(f + g) = Af + Ag, et que de plus elle soit majorable 
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par une opération linéaire positive, c’est-d-dire qu’il existe une opération linéaire 
positive B de sorte que l’on ait Af < Bf pour tous les f. La condition est 


évidente. 
L’opération A = B — Cod B 2 0,C 2 0 peut s’écrire sous la forme canonique 


A = B, — C,, en posant 
B, = B — inf (B, C) = sup (B, C) — C, 
C; = C — inf (B, C) = sup (B, C) — B; 


les opérations B, , C; s’appelleront respectivement les parties positive et négative de 
A et nous écrirons aussi pos A au lieu de B, et neg A au lieude C,. Je dis que 
ces opérations B, et C; sont disjointes et qu’il n’existe pas d’autre décomposition 
A = B, — C2, Bz 2 0, Ce 2 O avec Bz, C2 disjoints. En effet, la relation 


(1) 


B, || C, est assurée par le théoréme 8, et si l’on avait encore By || Cz , alors comme 
(2) Bi, + = B+ Cy 


et par conséquent B, < By + C;, il suivrait du théoréme 5 que B, S By, et par 
symétrie que Bz S B,, done B, = Bz et alors, par (2), C) = C2. 

Il s’ensuit que les opérations B, = pos A, C; = neg A sont déterminées par 
l’opération A elle-méme, indépendamment de la décomposition A = B — C 
dont nous sommes partis. De 1a il vient encore par (1) que pos A S B, 
neg A S C pour toute telle décomposition et comme A = pos A — neg A est 


elle-méme une telle décomposition, alors 
pos A = inf {B}, neg A = inf {C}, 


formés avec les B et C qui figurent dans l’ensemble des décompositions possibles. 

Les notions de minorante et de majorante, de plus grande minorante et de 
plus petite majorante s’étendent d’une maniére évidente aux opérations linéaires 
de signe quelconque; cela étant, on pourra aussi écrire 


pos A = sup (A, 0); neg A = sup (—A, 0). 


En effet, toute majorante B des opérations A et 0 ou ce qui revient au méme, 
toute majorante positive B de l’opération A, fournit une décomposition A = 
B-—C,B=0,C = B—A = Oet de méme, toute majorante positive C de —A 
donne lieu A une telle décomposition avec B = A + C = C — (—A) 2 0; 
de 1A notre assertion s’ensuit immédiatement. 

En résumé, on a le théoréme suivant. 

THtoREME 16. Toute opération linéaire A de signe quelconque peut étre écrite, 
dune seule maniére, comme différence de deux opérations linéaires positives et 
disjointes Vune a Vautre, savoir 


A = pos A — neg A; pos A = sup (A, 0), neg A = sup (—A, 0). 
Pour toute autre décomposition A = B — C,B = 0,C = Oona 
B 2 pos A, C 2 neg A. 


i 
| 
; 
3 
> 
| 


190 FREDERIC RIESZ 


En ce qui concerne la plus grande minorante et la plus petite majorante d’un 
ensemble {A}, on peut les construire par le méme procédé que dans le cas des 
opérations positives et il est aussi manifeste que les théorémes 1-4 restent 
valables, avec la seule modification que les ensembles 4 une infinité d’éléments 
ne sont pas nécessairement minorables. 

On pourra aussi étendre au cas envisagé la notion des ontnedian disjointes et 
cela en convenant de considérer comme disjointes les opérations A et B et 
d’écrire A || B lorsque l’on a simultanément 


pos A ||pos B, posA||negB, negA||posB, negA||negB 


ou plus briévement, avec l’écriture 
pos A + neg A = mod A, 


| toujours que 
' mod A || mod B. 


a On voit aisément que les théorémes 6-8 concernant les opérations disjointes 
ieee restent valables sous ces conditions plus générales. En effet, quant au théoréme 
a 6, le passage du cas des opérations positives au cas général se fait immédiate- 
ment l’aide des relations 


pos (sup {A}) = sup {pos A}; _ neg (inf {A}) = sup {neg A} 
qui 4 leur tour sont des conséquences des inégalités évidentes 
pos A = pos (sup {A}); sup {A} S sup {pos A} 2 0 


et de celles qui en ressortent lorsqu’on passe 4 l’ensemble {—A}. L’extension 
du théoréme 7 se déduit des inégalités évidentes 


(2) pos Ar S pos Ai; neg 2) Ar neg 


Enfin, pour étendre le théoréme 8, on n’a qu’a l’appliquer aux opérations 
pos A + neg Bet neg A + pos B au lieu de A et B et puis retrancher neg A + neg B 
de chaque terme de la formule qu’on obtient. 

Observons en passant que pour des opérations A, disjointes entre elles, c’est 
toujours le signe d’égalité qui est valable dans les formules (2). Cela vient en 
écrivant >» A, = >, pos A, —, >, neg A; et en observant que, par l’hypothése 
faite et par le théoréme 7 les deux termes du second meen représentent des 
opérations linéaires positives et disjointes. 

Pour étendre la notion de partie disjointe, convenons de dire que A est une 
partie disjointe de B et d’écrire A < B lorsque A || B — A, tout comme pour 
des opérations positives. On voit aisément que, avec la convention posée, les 
théorémes 10 et 12 subsistent. 

Hl Quant aux familles complétes et presque complétes, réservons ces dénomins- 
tions pour les familles d’opérations positives et appelons prolongement de la 
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famille ce qui en vient en y ajoutant les différences de ses éléments. II est 
manifeste qu’on obtient déja toutes ces différences, chacune en un seul exem- 
plaire, quand on se contente de former les différences 4 termes disjoints. On 
voit aussi immédiatement qu’une famille et son prolongement embrassent, avec 
A et B, leurs combinaisons linéaires aA + bB ov a et b sont des facteurs numé- 
riques de signe quelconque, ainsi que les opérations inf (A, B) et sup (A, B). 
Ce dernier fait résulte par exemple des formules 


inf (A,B) =inf (A +C,B+C)—C, sup (A, B) = sup(A+C,B+C)—C 


en posant C = neg A + neg B et en rappelant que le fait analogue, pour les 
opérations positives A + C, B + C, est impliqué dans la définition des familles 
complétes ou presque complétes. 


VII. Décomposition simultanée des opérations linéaires 


Le premier pas vers un calcul fonctionnel des opérations linéaires est la 
décomposition simultanée de deux opérations linéaires A et E dont la seconde 
est positive; cette opération H et certaines de ses parties disjointes auront 4 
jouer dans la suite le rdle d’une sorte d’unités. Nous supposons encore que 
l’opération A appartienne a la famille [HZ] de l’opération E ou bien, lorsque 
l’opération A n’est pas positive, au prolongement de cette famille ou ce qui 
revient au méme que toute opération disjointe 4 E le soit aussi par rapport a A. 

Soit \ une variable réelle et envisageons les opérations 


P, = pos (AE — A), 
Ny = neg (AE — A) = E+ A 


en fonction de A.. Elles sont positives et monotones, la premiére croissant avec 
et la seconde décroissant, c’est-a-dire que, pour \; < Az, on a Py, S Py, et 
Ny, 2 Ny,. Je dis que P, et Ny sont des fonctions convexes; cela veut dire 


que les opérations 


Prin — Pry — Nr 
h h 

sont des fonctions croissantes de \. Notre assertion est évidemment équiva- 
lente A celle que pour tout élément f, P,f et Naf sont des fonctions numériques 
convexes. Or, pour démontrer cette derniére assertion, nous n’avons qu’d 
observer que la fonction P,f est précisément l’enveloppe supérieure des parties 
positives des fonctions numériques linéaires \Eg — Ag, formées avec tous les 
g Sf, c’est-d-dire pour toutes les décompositions f = g + (f — g). Or toutes 
ces parties positives sont des fonctions convexes et par conséquent il en est de 
méme pour leur enveloppe supérieure. La convexité des fonctions Nf s’en 
déduit par la formule N, = P, — »E + A. 

Les fonctions Pyf et Nyf étant convexes, elles admettent partout des dérivées 
des deux cétés. Envisageons seulement les dérivées 4 droite; comme fonctions 
de X, elles sont croissantes et continues A droite. Comme fonctions de f, elles 
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définissent, pour chaque valeur de , deux opérations linéaires que nous dé- 
signerons par P; et Ny. Celles-ci sont évidemment des fonctions croissantes 
de \ et comme P, est croissante et Ny, décroissante, alors Px = 0, Nx < 0. 

Les opérations P, et N) sont disjointes de par le théoréme 16, P, || N,. 
Nous allons montrer que l’on a aussi P, || Nx, Px || Nx, Px || Nx. D’abord, 
N, étant positive et décroissante, on a 


SM (h > 0), 
done P || Ny entraine que P) || Ny — Ny. et par conséquent 


N,—N 
P, | 
de 1a il suit que P, || —N x ou alors P, || Nx, le passage a la limite étant assuré 
par le théoréme 7. Soit de plus \ < yu, donc \ + h S uw pour h suffisemment 
petit; alors comme P,,, || Naz, 2 Ny 2 0 et comme P) est positive, croissante 
et convexe, il vient que 
— P. 
ce qui donne que Px || N, ; de la on passe & Py || Ny en se servant du théoréme 7. 
Enfin un raisonnement analogue fournit 
Px | N ie N MA 
h 
done aussi Px || — Nx et alors Px || Ny. 


Cela étant, différentions la relation P, — N, = XE — A; on obtient que 
Px — Nx = E, donc on pourra poser, par définition, 


(1) E, = Py, E-K = 


et les opérations E, et E — E, sont des parties disjointes de E. La décomposition 
correspondante de Vopération A en parties disjointes est fournie par 


(2) A. =)\P,-P,, 


En effet, la somme des seconds membres donne A; on le vérifie en rappelant 
les relations P, — N, = XE — A, Px — Nx = E. Le fait que A, || A — A 
s’obtient par le théoréme 6 des relations entre P,, Ny et leurs dérivées que 
nous venons de démontrer. Par les mémes raisons on a aussi Ey || A — An 
et Ay || E — &. 

Etablissons encore une inégalité relative aux opérations E, , A, et impor- 
tante pour la suite. L’opération P, étant une fonction convexe de X, on 4, 
pour A < up, 

(u — = Py — Py = (u — »)Px. 
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En ajoutant \Px — uP, & chaque membre et en changeant les signes, nous 
obtenons 


MPa — Px) S — Px — + Pr — Pr) 
ou avec les notations (1) et (2) 
(3) — By) S Ay — Ar S w(E, — B). 


Calculons encore les limites de HZ, et A, pour d infini, positif et négatif. Pour 
0<vA> 


E> = Py = (Py — Po)/A 2 AE A 
ce quidonne E, = E. Deméme, pour0 > \— —~,ona 
E>E-—E,= AE -A+M)/A CE 


et par conséquent H_. = 0. De plus comme Ny 2 0 et Nx < 0, Vopération 
A — Ay = —ANx + Nj est positive pour } > 0 et sous la méme hypothése, 
d’aprés l’inégalité (3), A, croft avec 4. Done pour \ — ~, A) tend vers une 
opération limite A,,. Comme de plus FE, || A — A, 2 A — A,,, done Ky || A — 
A,, et par le théoréme 7 il en ressort que H = E,, = sup Fy, ||A — A,. Un 
raisonnement analogue donne |’existence de A_.. et la relation E || A_.. Or, 
par hypothése, A appartient 4 la famille [Z] de E ou 4 son prolongement et on 
voit successivement qu’il en est de méme pour P, , Ny , Px , Nx, done aussi pour 
A) et enfin pour ses limites A, et A_.. Ce fait et les relations FE || A — A,, 
E || A_.. que nous venons de prouver, ne sont compatibles que si A — A,, = 0, 
A_. = 0. 

En résumé, ona E,, = E, A,, = A, E-.j = A-» = 0. 

Enfin considérons les opérations 


Ao = —Po = — sup (—A, 0) = — neg A, 
A — Ao = pos A. 


L’opération Zo ou plutét la décomposition E = Ey + (HE — Ep) se distingue, 
parmi les décompositions de E en parties disjointes, par chacun des deux faits 
suivants. po est la plus grande partie disjointe de E qui est disjointe a l’opéra- 
tion pos A et EH — E, est la plus grande partie de FE appartenant a la famille de 
pos A. Comme nous savons déjA que Ey || A — Ao = pos A, nos assertions 
découlent du théoréme 14, aprés avoir montré que l’opération H — Ey appartient a 
la famille de pos A, c’est-A-dire qu’elle est disjointe A toute opération B disjointe a 
pos A. Or, pour une telle opération B, comme, par |’inégalité (3), pos A = 
A — Ay = A, — A, = (EB, — E,) pour 0 < d < gu, il vient que B || Z, — Ey 
et pour \ > 0, « — © il en découle, par le théoréme 7, que B || Z,, — Eo = 
E — Ey ce qu’il fallait démontrer. 

Du cas \ = 0 que nous venons d’envisager, on passe au cas général en rem- 
plagant \ par \ — Xo et A par A — Ao# et l’on obtient la caractérisation suivante 
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de la décomposition E = E,, + (EZ — E,,). Les opérations Ey, et E — Ey, sont 
les plus grandes parties disjointes de E, la premiére parmi celles qui sont disjointes ¢ 
pos (A — oF) et la seconde parmi celles comprises dans la famille de cette derniére. 
Il vient en particulier que E, = 0 pour les valeurs négatives de \ toujours quand 
l’opération A est positive et plus généralement que E, = 0 pour d < a lorsque 
A 2aE. Unautre cas particulier intéressant se présente quand A est une partie 
disjointe de £; un calcul facile donne que dans ce cas E, = 0, E — A, E suivant 
ques <0,0 SA <1,A 2 let A, = 0 pourdA < let A, = A pour) 2 1, 

Nous aurons encore besoin dans la suite d’une décomposition a deux para- 
métres de E, induite par deux opérations A et B. Soit B une opération linéaire, 
appartenant, tout comme A, a la famille de l’opération EF ou 4 son prolongement 
et faisons la décomposition simultanée de EH et B comme nous venons dela faire 
pour E et A. Pour distinguer, nous désignons par yu le paramétre qui intervient 
et par E, la décomposition correspondante de E. Envisageons l’opération 


(4) inf (Ey E,) 


comme fonction des deux variables \ et » et indiquons la propriété caractéristique 
qui distingue l’opération F),, parmi les parties disjointes de l’opération £. 
Elle suit immédiatement de nos résultats. L’opération F,,, est la plus grande 
parmi celles des parties disjointes de E qui sont disjointes 4 la fois aux opérations 


~C = pos (A — XE) et D = pos (B — uE) et par conséquent, E — Fy ,y est la 


plus grande partie disjointe de E comprise dans la plus petite famille compléte 
embrassant les opérations C et D. 

Notons encore que l’on a, pour — © <2 S ©, — © Sm < 
la relation 


que l’on déduit du théoréme 9 en passant d’abord, sous l’hypothése <A; || A:, 3 


B, || Bs, & la formule 
inf (Ai + Ae, Bi + Be) = inf (Ai, Bi + Be) + inf (Ai + Az, Bi) 
+ inf (Az, Bz) — inf (A1, B) 


et puis eny remplagant Az, B, et By respectivement par Fy, , Ay, — fr, 
E,, et E,, — £#,,. 


VIII. Calcul fonctionnel des opérations linéaires 


Envisageons comme au chapitre précédent une opération linéaire positive F 
et une opération linéaire A, de signe quelconque, appartenant A la famille de £ 
ou & son prolongement et soit E, la décomposition de E que nous venons de 
considérer. Supposons que la fonction ®(A) soit intégrable par rapport 4 fi, 
ce qui veut dire que la fonction ® est intégrable, au sens de Stieltjes-Lebesgue, 
pour tous les f, par rapport aux fonctions E,f , fonctions numériques non-dé- 
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croissantes et continues & droite de la variable \.° Lorsque la fonction ® est 
non-négative, il est manifeste qu’en faisant correspondre 4 chaque élément f la 
valeur fournie par l’intégrale respective, on aura défini une opération linéaire 
positive que l’on désignera par @g(A), pour mettre ainsi en évidence la dépen- 
dance de la nouvelle opération de EZ, A et de la fonction #(\); on écrira aussi, avec 
une notation évidente, 


Lorsque la fonction @ est de signe quelconque, |’opération définie de la méme 
maniére est encore une opération linéaire, étant évidemment égale a la différence 
des deux opérations positives qui correspondent aux parties positive et négative 
de la fonction ®. 

Une clef 4 l’analyse fonctionnelle des opérations du type ,(A) est fournie par 
le théoréme suivant. 

17. Les fonctions non-négatives et étant intégrables par 
rapport et de plus telles que = 0, les opérations et 
seront disjointes Vune a Vautre. 

Pour le démontrer, commengons par le cas particulier ot 4; et 2 sont les fonc- 
tions caractéristiques de deux ensembles e; et ¢2 sans point commun et supposons 
d’abord que |’ensemble e; soit la réunion d’un nombre fini ou d’une infinité 
dénombrable d’intervalles demiouverts \; < 4 < d; n’empiétant pas. Alors on a 


(2) = = (Ey; — 
et 


Or de Ey; < E, Ky; < E et Ey, S By; il vient immédiatement, par les 
théorémes 11 et 10, que Ey; — Ey, < E, c’est-a-dire que Ey; — EB; || E — (Ey; — 
F,,). Done par la formule (3) on a aussi Ey; — Ey, || &2 et de 1a il s’ensuit, par 
la formule (2) et par les théorémes 6 et 7, que #: || #2, ce qu’il fallait démontrer. 

Passons au cas ot l’ensemble e; n’est plus supposé d’étre composé d’intervalles ; 
alors en fixant l’élément f et en choisissant un « > 0, on pourra, comme on sait 
bien, enfermer e; dans un ensemble e; du type particulier que nous venons de 
considérer et cela de sorte que les mesures des deux ensembles, par rapport a 
ne différent que de au plus. C’est-d-dire que, en désignant par #3(A) 
la fonction caractéristique de l’ensemble es , la valeur Df de l’opération positive 
D = %3,(A) — (A), pour l’élément f, est supposée de ne pas dépasser la quan- 
tité «/2. Posons encore = — et soit la fonction caractéristique 
de ensemble e. Alors il vient par le résultat particulier que nous venons 


‘H. Lebesgue, ‘Sur Vintégrale de Stieltjes et sur les opérations linéaires,”” Comptes 
Rendus Acad. Se. Paris, 160 (1910), pp. 86-88. 
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d’établir, que ®; || 4 et par conséquent, d’aprés le critére formulé au commence- 
ment du chapitre III, il existe une décomposition f = fi + fo de sorte que 


S fe S &/2. 


Comme de plus Dfz Df S «/2 et S Bo(A) S + - 
,(A)), il s’ensuit que 


if, < «/2, Pofo + S € 


et d’aprés le critére cité cela implique que 4 || ®2. 

Passons au cas de deux fonctions bornées ®; et $2. Or il est manifeste qu’on 
ne restreindra pas la généralité en supposant que les fonctions soient inférieures 3 
Vunité et par conséquent, dominées respectivement par les fonctions caractéris- 
tiques WV; et V2 des ensembles e; et dans lesquels respectivement 0, # 0), 
Il s’ensuit que l’on a aussi ®; WV, , We pour les opérations respectives. 
Comme de plus les ensembles ¢; , ¢2 sont sans point commun, les deux opérations 
WV, et V2 sont disjointes d’aprés ce que nous venons d’établir et 4 plus forte raison 
il en sera de méme pour les opérations ®; et ®, . 

Enfin le cas général ot les fonctions ne sont plus supposées étre bornées, se 
raméne au cas précédant en tronquant d’abord les fonctions au niveau n et puis 
faisant passer n a l’infini ou ce qui revient au méme, en appliquant le théoréme 
7 aux opérations qui correspondent aux fonctions tronquées, ce qui achéve la 
démonstration. 

Voici un corollaire important de notre théoréme concernant les parties positive 
et négative d’1ne opération © correspondant 4 une fonction (A) de signe quel- 
conque. Les fonctions = pos (A) et = neg parties positive et 
négative de notre fonction, étant telles que le produit 4,4, s’annule partout, il 
vient que 4 || 2 ; comme d’autre part on a® = 4, — 42, pour les opérations de 
méme que pour les fonctions, il s’ensuit par le théoréme 16 que les opérations 
®, et &, ne sont que les parties positive et négative de l’opération. C’est-a-dire 
que les parties positive et négative de Vopération &g(A) sont précisément les deux 
opérations qui correspondent aux parties positive et négative de la fonction ®(d). 

Observons en passant que le théoréme 17 reste valable pour des fonctions de 
signe queleonque; on n’a en effet qu’A reprendre le dernier énoncé en se rappelant 
ce qu’il faut entendre par opérations disjointes quand il s’agit des opérations de 
signe quelconque. 

Le corollaire que nous venons d’énoncer suggére la généralisation suivante, 
embrassant 4 son tour le théoréme 17. 

18. Ftant données un nombre fini de fonctions intégrables par 
rapport soient respectivement ®(d) et leurs enveloppes inférieure et 
supérieure; alors on a 


@,(A) = inf  Be(A) = sup 


Ces faits subsistent pour une infinité dénombrable de fonctions ®,(d), sous la 
condition que Venveloppe en question soit intégrable par rapport d E, , condition 
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qui évidemment est toujours remplie pour des fonctions non-négatives ;(d) et pour 
leur enveloppe inférteure, ou aussi sous la condition que la plus grande minorante 
ou la plus petite majorante en question existe. 

En effet, pour deux fonctions 9; , #2 , le théoréme suit immédiatement de notre 
corollaire en rappelant que 


min (®:(A), 2(A)) = — pos (@1(A) — 42(d)), 
max (#:(A), 2(A)) = S2(A) + pos (@:(A) — &2(d)). 


Or le cas de n fonctions se raméne par récurrence & ce cas particulier et le cas 
d’une infinité dénombrable en ressort par le théoréme permettant d’intervertir, 
au cas des suites monotones, l’ordre de l’intégration et du passage 4 la limite. 

Le théoréme 18 nous fournit les moyens pour étudier dans toute leur généralité 
les liens qui existent entre les fonctions numériques composées et les fonctions 
symboliques d’opérations qui y correspondent. Avant d’y procéder, il nous faut 
encore calculer les opérations qui correspondent aux fonctions ®(A) = 1 et &(A) = 
\. Je dis que ce ne sont que les opérations HE et A. Quant a la premiére, notre 
assertion vient immédiatement de ce que E,, = E, E_. = 0. Quant Ala seconde, 
soit ¢ une quantité positive arbitrairement choisie et écrivons l’intégrale (1), 
avec (A) = A, sous la forme d’une série infinie dans les deux directions 


(i+])e 
6=)> 


écrivons de méme, avec les notations du chapitre précédent, 
A= (Agi. — Ave). 


Or, les termes de rang 7 des deux séries sont compris entre 7e(E(ii1. — Eie) et 
(i + 1l)e(Eusn. — Ee), ce qui est évident pour la premiére et s’ensuit pour la 
seconde des inégalités (3) du chapitre précédent. De 1a il vient immédiatement 
que 


et par conséquent on a précisément @ = A. 

Cela étant, nous procédons A démontrer le théoréme suivant. 

THfOREME 19. Soit B = &,(A) et soit E, la décomposition de E qui correspond 
a Vopération B. Supposons que la fonction Y(u) soit intégrable par rapport a 
E, et posons Y(®(A)) = T(A). Alors cette derniére fonction est intégrable par 
rapport et Von a 

T'(A) = W,(B) = Wz(z(A)). 

Briévement, la composition des fonctions symboliques du type ®,(A) répond a 

celle des fonctions numériques. Pour le voir, commengons par observer que si 


notre assertion est valable pour une fonction ¥Y, il en sera de méme pour la partie 
positive de cette fonction, ce qui vient immédiatement par le corollaire du 
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théoréme 17. Or nous venons de voir que notre assertion est vrai pour V(u) = | 
et pour V(u) = yu; donc elle est vraie pour toute fonction linéaire de yu et pour les 
parties positives de telles fonctions et enfin pour les combinaisons linéaires des 
fonctions considérées, c’est-d-dire pour toutes les fonctions continues et com- 
posées d’un nombre fini de traits linéaires. L’assertion reste aussi valable pour 
les limites des suites monotones, croissantes ou décroissantes, formées de fone- 
tions du type considéré; cela vient par le théoréme permettant d’intervertir 
ordre de l’intégration et du passage 4 la limite. De plus quand on n’envisage 
que des suites bornées, on est assuré d’avance que les fonctions limites sont 
intégrables par rapport 4 E,. Ces fonctions limites pour lesquelles on vient de 
vérifier le théoréme, ce sont précisément toutes les fonctions bornées continues 
ou semicontinues; on en passe 4 toutes les fonctions V bornées et intégrables par 
rapport 4 E, en fixant, outre la fonction V, un élément arbitraire f et une quantité 
positive ¢ et en rappelant que la fonction Y peut étre enfermée entre deux fonc- 
tions bornées et semicontinues WV; et Ve, Vi(u) S V(u) S We(u) et cela de sorte 
que la différence de l’intégrale de ces deux fonctions par rapport 4 EZ, soit infé- 
rieure i «. De 14 il vient immédiatement que 


| We(B)f — Te(A)f| < 


et comme f et € sont arbitraires, on conclut que l’assertion du théoréme subsiste 
pour toutes les fonctions ¥ envisagées. On étend la validité de l’assertion aux 
fonctions positives non bornées en les tronquant d’abord au niveau n, puis faisant 
n aller 4 l’infini; enfin, on passe aux fonctions non bornées et de signe quelconque 
en considérant 4 part leurs parties positive et négative, ce qui achéve la dé- 
monstration. 

Toutes nos considérations restent valables, avec des modifications plus ou 
moins évidentes, pour les fonctions de plusieurs variables. Envisageons, pour 
fixer les idées, le cas de deux variables. Alors on écrira, 4 l’analogue de la 
formule (1), 


ou E,,, est la décomposition 4 deux paramétres, envisagée A la fin du chapitre 
précédent, de l’opération EZ. La formule (4) veut indiquer que |’opération 
,(A, B) se calcule, pour tout élément f, en intégrant la fonction ®(A, u) par 
rapport A la fonction E,,f, calcul qui se fait comme dans le cas d’une seule 
variable; on n’aura qu’d remplacer les différences du type Ey, — Ey, par des 
différences itérées de la forme 


Or il vient de la formule (5) du chapitre précédent et des relations E,, — Ey, < £, 


E,, — E,, < E, par le théoréme 11, que ces différences itérées sont des parties 
disjointes de l’opération E; cela étant, le théoréme 17, avec sa démonstration et 
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son corollaire, ainsi que le théoréme 18, s’étendent immédiatement au cas de 
deux variables. 

Le cas de plusieurs variables s’achéve d’une maniére analogue. 

Quant au théoréme 19 concernant les fonctions composées, on le généralise 
peat le cas de n fonctions ;, --- ,®, dépendant d’une ou de plusieurs variables 
Am et une fonction , , Mn) dans laquelle on aura A substituer aux 
variables u; les fonctions ®;. Pour fixer les idées, soit m = 1, n = 2, ce cas 
montrant nettement les modifications nécessaires dans le cas général. Nous 
aurons done & considérer, outre les opérations E, A, E, , les opérations B = 
6,,(A), C = qui correspondent aux fonctions et $2(A) et l’opération 
v,(B, C) correspondant a la fonction Vu, v) et que l’on définit en intégrant la 
fonction par rapport E,,, = inf E,), les décompositions E, et étant 
celles qui correspondent respectivement aux opérations Bet C. Alors le théo- 
réme 19 reste valable, les modifications qu’il faut apporter 4 |’énoncé n’étant en 
substance que d’écrire ®, , 2 au lieu de ®; B, C au lieu de B et yu, v au lieu de ». 
Quant 4 la démonstration, la seule modification essentielle consiste en ce que, 
au lieu des sommes formées de fonctions linéaires et des parties positives de telles 
fonctions, il est préférable d’envisager des combinaisons linéaires des fonctions 
que l’on construit en partant d’un nombre fini de fonctions linéaires, homogénes 
ou non, de variables y, v et en passant, dans un certain ordre, 4 des enveloppes 
inférieures et supérieures. En effet, envisageons dans le plan un réseau tri- 
angulaire et formons, pour un des noeuds et pour chacun des triangles adjacents, 
les fonctions linéaires égales 4 un au noeud envisagé et s’annulant sur le cété 
opposé du triangle respectif. Puis, formons la partie positive de l’enveloppe 
inférieure de ces fonctions ou ce qui revient au méme, |’enveloppe supérieure de 
leur enveloppe inférieure et de 0. Cela étant, envisageons une fonction continue 
qui est linéaire dans chacun des triangles du réseau et constante dans les triangles 
éloignés; alors il est manifeste qu’une telle fonction pourra étre écrite toujours, & 
une constante additive prés, sous forme d’une combinaison linéaire des fonctions 
particuliéres que nous venons de former. En observant encore que l’évaluation 


des intégrales 


par intégration successive, exécutée pour les deux derniéres respectivement dans 
lordre et dans l’ordre u, v, donne C) = E, B, C respectivement pour 
V(u,¥) = 1, yw, », il vient que le théoréme 19 est valable pour toutes les fonctions 
que nous venons d’envisager. En faisant varier notre réseau, par exemple en le 
subdivisant indéfiniment d’une maniére convenable, nous disposerons d’une 
classe de fonctions dont les suites monotones suffisent largement pour approcher 
indéfiniment toutes les fonctions continues ou semicontinues et bornées des 
variables u, v; cela étant, la démonstration s’achéve comme pour les fonctions 
d’une seule 
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A titre d’exemple, disons quelques mots du produit des opérations A et B, 
formé par rapport A Vopération £ ct que nous désignerons par (AB); les opéra- 
tions A ct B sont supposées d’appartenir 4 la famille de # ou 4 son prolongement, 
Nous entendons par 1a l’opération &g(A, B) qui correspond au produit (A, u) = 
hu. Comme l’intégrale de cette fonction non bornée par rapport aux fonctions 
Ey,,f ne converge pas nécessairement dans tous les cas, le symbole (AB), n’a 
pas toujours de sens; mais on voit immédiatement que le produit existe, entre 
autres, lorsque l’une des opérations A, B est bornée par rapport 4 E, c’est-a-dire 
qu’elle est comprise entre des multiples numériques de £. 

La multiplication que nous venons d’introduire est évidemment commutative; 
les autres régles de calcul, moins évidentes, sont des corollaires du théoréme 19. 
Tels sont la loi distributive (A(B + C))z = (AB)z + (AC)z, valable sous 
l’hypothése que deux des trois produits impliqués aient un sens et la loi associa- 
tive ((AB)sC)z = (A(BC)z)« , soumise a l’hypothése que les produits (AB), , 
(BC), et ’un au moins des deux produits itérés existent. 

Dans mon mémoire original ot j’ai étudié ces produits d’opérations d’un> 
fagon détaillée, je les ai introduits sans faire appel 4 l’intégrale (4). Dans le cas 
ot. A et B sont des opérations positives, cas auquel on reméne sans peine le cas 
général, ma définition d’autrefois était la suivante. Le produit (AB)¢ se définit 
comme la plus petite majorante de toutes les opérations du type ab’ ot FE’ 
parcourt les parties disjointes de l’opération E et les facteurs numériques a = 0, 
b 2 0 varient sous les conditions ak’ S A et bE’ < B. Je laisse au lecteur de 
vérifier |’équivalence des deux définitions. 

Voici un autre exemple, plus intéressant par des raisons que nous allons voir. 
Supposons que A = 0, B = 0 et considérons l’opération (A'B’), qui correspond a 
la fonction d*u', envisagée pour \ = 0, u = 0; il est clair que sous l’hypothése 
faite, les valeurs négatives des variables ne comptent pas, c’est-A-dire que dans 
la définition (4) de @z(A, x), il est permis de remplacer par zéro la limite inférieure 
d’intégration, nos théorémes restant applicables aprés cette modification. Cela 
étant, je dis que la “moyenne géométrique” (A*B')g ou briévement A'B? ne 
dépend pas du choix particulier de Vopération E. En effet, la fonction Nu! n’est 
que l’enveloppe inférieure des fonctions (chk + u/c)/2 parcourt une suite 
partout dense de valeurs positives, par exemple celle des nombres positifs ra- 
tionnels. II s’ensuit par le théoréme 18 que l’opération A‘B? n’est que la plus 
grande minorante des opérations (cA + B/c)/2 od ¢ parcourt les nombres 
positifs rationnels ou, ce qui revient au méme, toutes les valeurs positives, 
c’est-’-dire que notre “moyenne géométrique” peut étre construite sans faire 
appel 4 l’opération Z. D’ailleurs, en rappelant la démonstration du théoréme 1, 
on aboutit aisément a la construction suivante. La valeur de A’B’ pour un 
élément donné f est la borne inférieure de l’ensemble des valeurs que prend 
l’expression 


pour tous les n et pour toutes les décompositions f = fi + --- +f, de l’élément f. 
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(‘ce dernier exemple et d’autres plus généraux, comme A “B® avec a, B > 0, 
a+6= lou A*B°C’ avec a, B, y > 0,a + 8+ y = 1, nous suggeérent d’envi- 
sager 'hypothése générale sous laquelle opération correspondant a la fonction 
est indépendante du choix particulier de l’opération E. Pour fixer les idées, 
nous envisageons les fonctions de deux variables ect cela en observant que le 
cas des fonctions d’une seule variable est dépourvu d’intérét, n’embrassant que 
le cas des fonctions de la forme ak + b| |. Nous allons montrer que l’opéra- 
tion Py est indépendante de E toujours que la fonction ®(d, u) est positivement 
homogéne du premier ordre, c’est-d-dire que, pour tout c 2 0, 


(5) ch) cP(X, 


A cet effet, observons qu’une large classe de telles fonctions s’obtient en partant 
des fonctions linéaires homogénes ou ce qui revient au méme, des variables X, 
u elles-mémes et en formant alternativement, de toutes les maniéres possibles, 
des enveloppes et des combinaisons linéaires. On arrive aux méme fonctions en 
partageant le plan en un nombre fini de régions angulaires issues de |’origine et en 
considérant les fonctions continues dans le plan entier et égales, dans chacune 
de ces régions, 4 une fonction linéaire homogéne dont l’expression varie de 
région 4 région. Or on voit immédiatement que ces fonctions suffisent pour 
approcher indéfiniment, par des suites monotones, toutes les fonctions semi- 
continues ® du type (5) et en particulier celles qui sont inférieures en module 4 
un multiple numérique de la fonction | \| + ||. Done pour toutes ces fone- 
tions notre assertion se vérifie par le théoréme 18, énoncé pour des fonctions de 
deux variables. C’est-d-dire que pour toutes ces fonctions l’opération ®,(A, B) 
existe, au sens de la formule (4), quelle que soil l’opération linéaire positive E, 
sous la seule hypothése que A et B appartiennent 4 la famille de E ou a son 
prolongement et de plus, la fonction étant fixée, l’opération B) 
est la méme pour toutes les ZH. Soient maintenant E et E deux queleconques 
parmi les opérations E envisagées, soient Fy,, et £y,, leurs décompositions par 
rapport 4 A et B et supposons que la fonction ®(A, «), du type (5), soit intégrable 
par rapport 4 la décomposition £,,, ; en attendant, nous supposons encore que 
/®(\, | reste inférieur & un multiple + |u|. Soient de plus une 
quantité positive et f un élément arbitrairement choisis; alors en raisonnant 
comme au cours de la démonstration du théoréme 19, nous enfermons ® entre 
deux de nos fonctions semicontinues soit ®; , ott S w) S 
choisies de sorte que les intégrales de ces deux fonctions par rapport 4 Fy,,f ne 
différent que de ¢ au plus. D’aprés ce que nous venons de voir, la valeur de ces 
intégrales n’est pas changée quand on remplace E par £; comme de plus f et ¢ 
étaient arbitraires, il vient que (A, B) = ®2(A, B). 

Enfin, pour les autres fonctions @ positivement homogénes et intégrables par 
rapport la décomposition E,,, , la démonstration s’achéve en considérant 
part les parties positives et négatives de ces fonctions et en les tronquant au 
mveau n (|X| + | w |), e’est-A-dire en formant, par exemple, l’enveloppe infé- 
neure de pos ® et de n(| \| + | « |), et en faisant passer n A l’infini. 

Pour illustrer ces considérations, envisageons l’exemple 1 du chapitre I, 
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c’est-a-dire le domaine formé par les fonctions continues f(x) définies dans 
Vintervalle a S$ z S b. On sait que les opérations linéaires de ce domaine Q 


peuvent étre écrites sous la forme d’une intégrale de Stieltjes 


6) Af = dal), 


ott a(x), la “génératrice” de l’opération A, est une fonction 4 variation bornée 
que l’on peut supposer continue 4 droite 4 l’intérieur de l’intervalle (a, b) et 
s’annulant pour x = a, hypothéses sous lesquelles la fonction a(x) sera unique- 
ment déterminée par l’opération A.’ Inversement, chaque fonction A variation 
bornée donne lieu 4 une opération A. Les opérations positives sont précisément 
celles dont les génératrices sont des fonctions non-décroissantes. Soit EZ une 
telle opération et soit ¢«(x) sa génératrice; quant 4 l’opération A, supposons 
d’abord qu’elle soit positive et bornée par rapport 4 #, 0 S A S&S cE; alors, 
grace 4 un théoréme de M. Lebesgue, on pourra écrire, au lieu de (6), 


(7) Af = / a(x) f(x) de(x) 


ot la fonction a(x), positive et bornée, n’est que la dérivée de a(x) par rapport a 
e(x), existant presque partout par rapport 4 cette derniére et ot l’intégrale au 
second membre est prise au sens de Stieltjes-Lebesgue. Dans le cas général 
ot l’opération A, de signe quelconque et bornée ou non par rapport 4 E£, est 
supposée d’appartenir 4 la famille de E ou 4 son prolongement, on n’aura qu’ 
considérer l’opération A, — Ao, positive et bornée par rapport 4 E conformé- 
ment 4 l’inégalité (3) du chapitre précédent, et allant en croissant respectivement 
vers pos A et neg A pourA — + ©. Donec on n’aura qu’a écrire |’opération 
A, — Apo sous la forme (7), avec une fonction a(x; d) au lieu de a(x) et faire les 
deux passages 4 la limite ; alors la formule (7) sera vérifiée dans le cas général, avec 


a(x) = lim [a(x;) — a(x; —d)]. 


Enfin, le fait que cette fonction a(x), intégrable par rapport & e(x), n’est que la 
dérivée de a(x) par rapport 4 ¢(x), vient en comparant les formules (6) et (7). 

Observons encore les faits évidents que l’opération ¢.A + cE appartient aussi 
au type envisagé et qu’elle s’écrit en remplagant, au second membre de (7), 
a(x) par ca(x) + ¢2 et que de plus si B est une opération du méme type (pour 
laquelle le multiplicateur a(x) est 4 remplacer par b(z)), il en est de méme pour 
les combinaisons linéaires de A, B et E et les multiplicateurs respectifs sont 
fournis par les combinaisons analogues des fonctions a(x), b(x) et 1. De plus 
les opérations inf (A, B) et sup (A, B) appartenant a la famille de EZ ou 4 son 
prolongement, sont nécessairement du type envisagé et il vient immédiatement 
que les multiplicateurs respectifs ne sont que les enveloppes, inférieure et supé 


’ F. Riesz, “Sur les opérations fonctionnelles linéaires,’’ Comptes Rendus Acad. Sc. Paris, 
149 (1909), pp. 947-977. 
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rieure, des fonctions a(x), b(x). Il ressort de ces remarques, d’abord pour les 
fonctions ®(A) et (A, #) continues et composées d’un nombre fini de traits 
linéaires, que les multiplicateurs qui correspondent aux opérations ,(A) et 
,(A, B) ne sont que les fonctions composées ®(a(x)) et &(a(x), b(x)), et on 
démontre aisément, en raisonnant comme nous venons de le faire 4 d’autres 
oceasions, que le méme fait subsiste pour toutes les fonctions  intégrables 
respectivement par rapport & ou 
Un cas particulier trés simple se présente lorsque E est l’intégration 


By = ae, 


done e(x) = x, a(x) = a’(x) et toutes les intégrales qui interviennent sont des 
intégrales de Lebesgue. Un autre cas particulier est celui ot e(x) est une fonction 
des sauts; dans ce cas les intégrales se réduisent & des sommes finies ou 4 des 
séries absolument convergentes. 


IX. L’extension du domaine fondamental 


Un des cas particuliers les plus importants d’une telle extension se présente 
quand, pour introduire l’intégrale de Lebesgue, on passe des fonctions continues 
ou encore de celles du type escalier aux fonctions sommables, passage que |’on 
peut effectuer de plusieurs maniéres, entre autres en se servant de suites bornées 
ou aussi de suites monotones de fonctions, convergeant partout ou seulement 
presque partout; on peut aussi envisager des suites qui, au lieu de converger au 
sens ordinaire, ne sont supposées que de converger en mesure ou en moyenne ou 
seulement d’une fagon faible par rapport a tel et tel exposant. Pour procéder 
d’une maniére analogue quand il s’agit des domaines abstraits, il faudrait com- 
mencer par fixer une notion de convergence pour les suites formées d’éléments du 
domaine envisagé. Une des définitions possibles, d’ordre général, est la suivante. 
La suite {f,} est dite convergente si les suites numériques {Af,} convergent 
quelle que soit l’opération linéaire A. Des définitions moins restrictives se 
posent en exigeant la convergence des suites {Af,} seulement pour une certaine 
famille d’opérations. Ainsi par exemple, quand il s’agit du domaine 2 formé 
par les fonctions continues et non négatives dans l’intervalle a < x S 5, la pre- 
miére hypothése revient A considérer comme convergentes celles des suites qui 
sont bornées et convergent partout, tandis que les définitions moins restrictives 
dont nous venons de parler, embrassent les suites bornées ou non qui convergent 
partout ainsi que certains types de convergence faible. 

Un principe d’extension plus systématique nous est suggéré par l’idée de 
dualité. Nous nous contentons d’esquisser quelques idées vagues qui s’y 
tattachent. Nous partons du fait que les opérations linéaires et positives d’un 
domaine 2 forment a leur tour un domaine 0* satisfaisant aux mémes hypo- 
théses, savoir aux hypothéses 1-4 posées au chapitre 1 et méme & l’hypothése 4’. 
Parmi les opérations linéaires A* de ce nouveau domaine fondamental se trou- 
vent, dans un sens A préciser, les éléments f du domaine 2 dont nous sommes 
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a 


partis. En effet, un tel élément f fait correspondre 4 chaque opération 4, 
c’est-A-dire A chaque élément A du domaine %, la valeur Af et l’opération aingj 
définie dans le domaine 2* est évidemment linéaire. Cela suggére de faire 
l’extension du domaine Q en y ajoutant toutes les autres opérations linéaires dy 
domaine 2*. Or considérons sous cet aspect notre exemple 1, le domaine 
formé par les fonctions f(x), positives et continues poura S x Sb. Nous savons 
déjA que, dans le cas envisagé, les opérations A sont de la forme 


Af = f(x) da) 


ott a(x) est une fonction non décroissante, d’ailleurs arbitraire et que de cette 
sorte, le domaine Q* pourra étre considéré, en substance, comme formé par ces 
fonctions a(x), toujours en convenant de regarder comme identiques deux 
fonctions a(x) lorsqu’elles donnent lieu 4 la méme opération A, ce qui revient 4 
ce que les deux fonctions ne différent que d’une constante, sauf peut-étre en un 
ensemble dénombrable de points x, intérieurs 4 l’intervalle (a, b). Or parmi les 
opérations linéaires que l’on peut définir dans ce domanie 0*, se trouvent entre 
autres celles de la forme 
A*a = g(x)[a(x + 0) — a(x — 0)] 

ou g(x) désigne une fonction bornée, d’ailleurs queleonque, mesurable ou non. 
Ces opérations particuliéres n’agissent que sur la partie de a(x) connue sous le 
nom fonction des sauts; elles peuvent aussi étre écrites sous forme d’intégrale de 
Stieltjes-Lebesgue pour les fonctions a(x) qui coincident avec leurs fonctions 
des sauts, mais elles ne le peuvent pas pour les fonctions a(x) du type général. 
Tout cela montre que l’extension envisagée est trop large sous certain aspect, de 
sorte qu’il est préférable de se limiter, au lieu de toutes les opérations linéaires 
du domaine 2*, 4 des opérations A* qui, pour ainsi dire, sont assez proches de 
celles attachées aux éléments f du domaine 2. A cet effet, considérons la plus 
petite classe d’opérations A* embrassant celles attachées aux éléments f et 
jouissant des deux propriétés suivantes: 1) elle forme un module, c’est-i-dire 
que, avec A} et Ag, elle contient aussi la différence AY — A? ; 2) elle est com- 
pléte, c’est qu’elle comprend les plus petites majorantes de ses sous-ensembles 
majorables. L’existence d’une telle classe est immédiate; en effet, elle n’est 
que la partie commune de toutes les classes jouissant des propriétés exigées. Or 
envisageons une opération A* appartenant A la classe considérée et attachons A 
cette opération la fonction a*(x) définie pour chaque 2» par les formules 


a* (29) A*(Az); (20); 


on voit aisément que ces fonctions a*(x) donnent précisément |’ensemble des 
fonctions bornées et mesurables (B) dans l’intervalle (a, b) et que de plus l’opéra- 
tion A* s’exprime par l’intégrale de Stieltjes-Lebesgue 


| { 
| 
| 
| 
{ 
| 
| 
{ 


LA THEORIE GENERALE DES OPERATIONS LINEAIRES 205 


Appliquons ces considérations 4 l’exemple 7 dont l’exemple 1 n’est qu’un cas 
particulier et qui nous conduira a la théorie d’intégration due 4 M. Daniell. 
Sans restreindre la généralité, nous pouvons supposer que, tout comme dans 
exemple 1, le domaine 2 comprenne les multiples cf(x), c 2 0, de ses éléments 
f(c). En effet, s'il n’en était pas ainsi, on pourrait tout d’abord ajouter au 
domaine Q, d’une fagon évidente, les multiples ¢f(x) ot c est rationnel et il est 
aussi manifeste que l’hypothése 4’ subsiste aprés cette extension. De plus 
comme on aura défini l’opération A pour les nouveaux éléments cf en posant 
A(cf) = cAf, le domaine 0* ne sera pas changé. Une seconde extension, faisant 
entrer aussi les c irrationnels (et quelquefois, comme le montrent les exemples 
2 et 3, d’autres éléments nouveaux) se fera en ajoutant encore les limites des 
suites qui convergent d’une maniére relativement uniforme, comme les a définies 
E. H. Moore, c’est-A-dire des suites f,,(x) telles que, pour presque tous les m et n 


| — | S efo(x) 


ou ¢ est une quantité positive arbitrairement fixée et fo(x) un élément de Q 
attaché 4 la suite.’ On voit aisément que le domaine ainsi étendu remplit les 
hypothéses 1-4 et aussi 4’ et que les opérations A du domaine @ se prolongent au 
domaine étendu. 

Supposons done que le domaine 2 comprenne avec f(x) tous les multiples 
cf(z), c 2 0; alors nous voici au point de départ de la théorie de M. Daniell. 
Cela étant, il ne faut que suivre le méme ordre d’idées que dans le cas des fone- 
tions continues pour arriver au domaine 2*, ensemble des opérations A de Q, 
et au plus petit module complet d’opérations A* embrassant les opérations 
particuliéres qui correspondent aux éléments f(x) de Q et aux fonctions a*(zx) 
introduites comme plus haut. Mais il convient de noter que dans le cas de 
exemple 1 la fonction a*(x) attachée 4 l’opération A*, une fois construite, 
détermine l’opération A* conformément A la formule (1). Or il n’est pas ainsi 
dans le cas général; pour le voir, nous n’avons qu’A modifier l’exemple 1 en 
limitant la variable aux valeurs rationnelles de x, mais, en récompense, en exi- 
geant des fonctions f(x) d’étre uniformément continues. Ces conventions un 
peu artificielles semblent de ne rien changer et on ne se rend compte d’une 
différence essentielle entre les deux exemples que lorsqu’on arrive aux fonctions 
a*(x). Il est manifeste que ces fonctions ne déterminent plus les opérations A* 
pour tous les éléments A du domaine 9*, ni méme pour les A du type particulier 
A,f = f(x) sauf quand 2 est rationnel. 

Ce phénoméne explique pourquoi la théorie de M. Daniell était foreée 4 ne 
considérer comme intégrale que celles des opérations A qui satisfont encore a 
Vhypothése (L), savoir que Af, — 0 pour toute suite f,(x) décroissante et tendant 
partout vers zéro. Dans l’exemple 1, l’hypothése (L) est réalisée pour toute 
opération A, comme cela vient par le théoréme classique de Dini; on voit im- 
médiatement qu’il n’en est pas ainsi pour l’exemple modifié. 


: E. H. Moore, “On the Foundations of the Theory of Linear Integral Equations,”’ 
Bulletin American Math. Soc., 13 (1912), pp. 334-362. 
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Montrons que les opérations A jouissant de la propriété (L) forment une famille 
complete. I] est évident que ces opérations satisfont aux deux premiéres hypo- 
théses posées pour les familles complétes; avec A: , elles comprennent aussi toutes 
les A S A, et avec A; et Ao, leursomme A; + Az. Ce qu’il faut encore prouver, 
c’est que la plus petite majorante de tout sous-ensemble majorable appartient 
aussi 4 la classe considérée. C’est-a-dire que si Af, — 0 pour toute suite dé 
croissante f,(2) — 0 et pour tout élément d’un ensemble majorable {A}, on a 
aussi Af, — 0 pour A = sup {A}. 

Fixons une quantité ¢ > 0; alors, d’aprés la construction de sup {A}, on peut 
tirer de notre ensemble un nombre fini d’opérations, soit Ai, Az, --- , Am et 
faire une décomposition fi = fu + fiz + --- + fim de fagon que 


Arf 2 Afi — € 


on aura A,,f, = 


et par conséquent, en posant sup (A1, Ae, --- , Am) " 
0, fn fi, il vient 


Af, — ¢, done (A — A,,)fi S €; comme de plus A — A» 
que, pour tout n, 
(2) (A Am)fn Se. 


D’autre part on a 


An 
> 


Anfn 


done par (2), pourn— ~, 
lim Af, = lim (A — An)fn © 6 


e’est que Af, — 0, c. Q. F. D. 

Pour élucider complétement l’hypothése (L) au point de vue de notre théorie, 
il nous faudrait encore examiner si en nous bornant, au lieu de toutes les opéra- 
tions A, a celles qui satisfont 4 l’hypothése (L) et en nous servant des opérations 
A* de ce domaine réduit considéré comme domaine fondamental, |’extension 
ainsi faite du domaine 2 n’est pas plus large que celle basée sur le domaine 
Q* et si, d’autre part, aprés cette réduction, les opérations A* sont déterminées 
univoquement par les fonctions a*(x) qui leur correspondent. La réponse 4 
la derniére question sera en général négative, comme le montre déja l’exemple 1, 
et pour arriver 4 une extension intrinséque qui n’est pas excessivement large, 
le mieux sera, dans la plupart des cas, d’avoir recours au plus petit module com- 
plet embrassant les opérations A* qui correspondent aux fonctions f(x) dont nous 
sommes partis. 


SzEeGEp, Hungary. 
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CHARACTER GROUP OF A RELATIVE HOMOLOGY GROUP 
By W. FLEXNER 
(Received April 21, 1939) 


1. Introduction 


If L is a subcomplex of a finite complex K, the topologically invariant factor 
group of the group of p-cycles of ZL bounding in K taken with respect to the sub- 
group of those bounding in L is useful in topology. The principal aim of this 
paper is to find its character group and to interpret the character group geo- 
metrically (theorems 4.2 and 4.4 at the end). Incidentally some algebraic and 
topological side results are obtained which will, it is hoped, help in finding 
character groups of less simple topologically invariant groups. 

The set of elements of a group containing more than one element are turned 
into a Hausdorff space by definition of open sets which satisfy, besides the 
Hausdorff axioms, axioms connecting the group operation and open sets, for 
instance those given by van Kampen’ or Pontrjagin.” A neighborhood of an 
element of the group is an open set containing the element, and a nucleus is a 
neighborhood of the identity. The closure of a set F of a Hausdorff space will 
be written F. A Hausdorff space is bicompact if every covering of the whole 
space by open sets has a finite subset which covers the whole space (A.H. p. 86’). 
The space will be called discrete if every element is an open set. 

As all groups here considered are Abelian, the group operation will be written 
additively, a + b. If H is the group of continuous homomorphisms of the 
topological Abelian group G into (A.H. “‘in’’) the additive group R of the real 
numbers mod 1 with the usual topology, then H is called the character group 
of G. When G is discrete and H is topologized as in v.K. or P.B., H is bicompact. 
When G is bicompact, H is given discrete topology. 

If G and H are topological Abelian groups and to every g eG and h e H there 
is an element gh = hg of R such that gh is a continuous function of g and of h, 
g+g')h = gh + Q/hand g(h + h’) = gh + gh’, then G and H are said to be 
paired; each h defines a continuous homomorphism of G into R, and each g a 
continuous homomorphism of H into R. If G and H are paired and © is a 
subgroup of H, © = (G, §) is defined as the set of all elements g ¢ G such that 
gh = 0 for every he S. The set (G, H) is a subgroup of G. (P.B. Note that 
none of the results in this paper depend on the Weyl-Peter theorem, see P.G.*) 


‘van Kampen: Locally Bicompact Abelian Groups and their Character Groups; Annals of 
Math. 36 (1935) pp. 448-463. Referred to in the sequel as ‘‘v.K.’’. 
Topological Groups, (translation by E. Lehmer) Princeton, 1939. Referred 
oas “PB.” 

* Alexandroff-Hopf: Topologie, Berlin 1935. Referred to as ‘‘A.H.” 

* Pontrjagin: The Theory of Topological Commutative Groups: Annals of Math. (35) 
1934, pp. 361-388, referred to in the sequel as ““P.G.”’ 
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A. The necessary and sufficient condition that a discrete group G and a bi- 
compact group H be character groups of one another is (G, H) = (H, @) = 0. 
(P.B.; P.G. p. 369; v.K. pp. 445, 456) 

If & is a closed subgroup of G (i.e. a subgroup whose elements form a closed 
set) the factor group G — @ may be topologized by calling open in G — & any 
set of G — @ corresponding to an open set in G. (v.K. p. 450) 

B. Then if G is discrete (bicompact) and @ is a closed subgroup of G, G — @ 
is discrete (bicompact). (v.K. p. 450) 

By sG is meant the direct sum to s terms (s finite) 


G=G+G+... +4. 


If g' is an arbitrary element of G, the typical element of sG will be written in 
vector form as g = (g', ---,g°). To topologize sG call open in sG any set U of 
elements g such that g' C U'*, i = 1, ---, s, where U' is open in G: U = (U’, 
..., U’). To get the group operation in sG, if g’ = (g", --- ,g”) theng +g’ = 
+ g”, + g”). 

C. If Gis discrete, sG is discrete. If Gis bicompact, sG is bicompact. 

D. If G and FH are character groups and therefore paired, sG and sH can be 


paired by the rule 


gh = 


where h = (hi, --- , hs), hi € H, is the typical element of sH. It follows from 
A and B that sG and sH are character groups. 

E. If G is discrete or bicompact, H is its group of characters and © is a closed 
subgroup of H, @ = (G, §) and § = (H, @) are equivalent properties (P.B.; 
P.G.). 

F. If F is a transformation from sG to tG such that 


Fg',---,9°) 


where f' = bam a;;9’, a;; integers, then F is continuous. 
G. The subgroup A = {g} of sG such that Fg = Ois closed in sG. This follows 
from F and the fact that 0 is closed in tG. 
H. The set of elements F9, g ¢ sG, is closed in tG, G@ bicompact. This is because 
every continuous transformation of a bicompact space into a Hausdorff space 
carries closed sets into closed sets (A.H. p. 95) and sG is closed. 


2. Algebraic lemmas 


The following notation is now useful: 

a. © < G means G is a subgroup of G. 

b. © KG means G is a closed subgroup of G. 

c. When F < sH, F” shall be the set of elements (f,, --- ,f.) of oH for which 
(fi, +++ ,fe,0, --+ , 0) an element of F. 
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d. When F < sH, F* ts the set of elements (f, , --- , f.) of-oH such that (f, , «++ , 
€F. 

e. When F < o H, [F] ts the set of elements (fi, ---, fo, +++ fs) such that 
(fi, fe) €F and is arbitrary in H. 

{. When F < sH, [F] = [F*] 

g. H means and are isomorphic. 

h. If A < G, B < Gthen A — B is the set of all elements a + b inG, ae A, 
beB. (Obviously A B < G.) 

It is apparent that F < sH implies F” < F* < cH and that F < cH or F < sH 
implies [7] < sH. Immediate lemmas are: 

1. If F then F’ 

2. If G is discrete or bicompact and A, B, KG, then A B&G. 

3. If H is discrete or bicompact and F & sH, then F’ K F* K oH. 

4. If F<oH or F K sH and H is discrete or bicompact, then [F|] K sH. 

TueoreM 2.1. If G and H are character groups, one being discrete, the other 
bicompact and if F < sH, then (sG, F)’ = (0G, F*). 


Let g = (g', +49", and g’ = 9); f = fe). If 
g' F*) then g'f: = Ofor every fe F, and Dini = for 
everyfeF. So forg” = (g', ---, 9%, 0, ---, 0) = Ofor every fe F. 


Hence g’’ is in (sG, F) and the corresponding g’ therefore is in (sG, F')’, so 
(sG, F)’ D (oG, F*). 

If g' (sG, F)’ then = 0 for every f F and hence for every f F*. 
Therefore g’ € (oG, F*) and (eG, F*) > (sG, F)’ which completes the proof. 

THEOREM 2.2. If Gand H are character groups and F < aG, then, (sG, [F]) ~ 
(oG, F). 

Let I be the isomorphism I(g',---,g’, 0,---,0) = @,---,9%). If 
(9°, € (oG, F) then Lint g'fi = 0 for every (fi, fo) eF, and 
Ig’, , 9°) € (sG, [F]). On the other hand, if (g’, --- , g*) 


(1) hi+ LD =, 

i=] j=o+1 
where (f,, +--+, fe) €F and are arbitrary elements of H. Setting 
Jory = f, = 0 gives 
(2) fi 0, 

i=1 

s0 (1) implies = 0 for every choice of , fe. Suppose that 
9°"’ ¥ 0 for the element g = (g', --- , 9°) of (sG, [F]). Then by A there is an 


fexieH such that 0. As element of [F] choose f = (fi, --- , 
where (f,,+-+,fo)eF and fj = 85"feri, 0 <j 8. Then by (2), of = 
9 fori # 0 and g is not in (sG, [F]). So all elements of (sG, [F]) are of the 
(', 0, --- ,0) with g‘f; = 0 by (2). Hence F) D 
[F)). 
TunoreM 2.3. If F < sH then (sG, F)’ ~ (sG, [F)). 
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By theorems 2.1 and 2.2 and the definition of [F], (sG@, F*) 
(sG, [F*]) = (sG, [F)). 

TuroreM 2.4. If A, B < G, thn G — Av B = (@ — A) — B* where 
is the set of elements of G — A (cosets) such that each coset in B* contains an 
element of B. 

The proof depends on showing that the necessary and sufficient condition 
that two elements, g and g’ of G be in the same coset of (@ — A) — B* is that 
g—g 

TuHeoreM 2.5. If G and H are character groups and A, B < G,C,D <H 
and 1.A = (G, C), 2. B = (G, D); then A ~ B = (G, C — D), where A ~ B 
a is the set common to A and B. 

Suppose ze A ~ BandceC,deD. Then + d) = ac + ad = 0 by (1) 

and (2) sincere A andzeB. Soxe(G,C D) and A~BC(G, 

Suppose (G,C D). Then + d) = 0 for everyceC anddeD. Take 
d = 0; then xc = 0 for every ceC and so by (1) xe A. Similarly xe B so 

THEOREM 2.6. If G and H are character groups of one another, one being 

discrete, the other bicompact and 


i TKO<KG 
HKH 
(1) ii. G = G, 8) 

(2) = (CH, 


then it follows that X = © — T and Y = § — 6 are character groups. 
Property D gives 

(3) 6 = (H, ®) 

(4) r = (G, §). 

Now pair the cosets of X and 7 of Y by means of the pairing zy, x e G, ye H 
which exists because G and H are character groups. If z, x’ belong to &, and 
y, y’ belong to 97, x — a’ eT, y — y’ €6. Since ye §, (2) gives (x — 2’)y = 0, 
so xy = 2’y. Again, since x’ C G, 2’(y — y’) = 0 by (1) so a’y = 2’y’, and 

- finally zy = x’y’. This makes it legitimate to define 


(5) ig = xy where ved, 


The two distributive laws hold for #7 since they do for xy, so X and Y are 
paired by (5). 

Asking for those @ for which #7 = 0 for every 7 € Y is equivalent by (5) to 
asking in which coset of © — IT x must lie in order that zy = 0 for every y € ©. 
Since x eG < G, it follows from (4) that x eT, so is the identity of X and 
(X, Y) = 0. Similarly (Y, X) = 0. By B one of the pair X, Y is discrete, the 
other bicompact so by A they are character groups of one another. 
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3. Chains of the complex K 


Let K be a complex made up of a finite number of elements (cells) and having 
its incidence relations given by matrices M’’”* = || u?; || whose elements are 
integers (including zero). The matrix M »?~! determines the incidences between 
the p-dimensional elements (rows, first subscript) and the (p — 1)-dimensional 
elements (columns, second subscript). The matrices have the property 
= 0. 

If K has s, p-dimensional elements, and a subcomplex L has o, p-dimen- 
sional elements, number these elements so that the first ¢, are in L. Then 


(1) u?}; = 0 when io, and j>o,1 simultaneously 


because every (p — 1)-element incident with a p-element of L is itself in L by 
definition of subcomplex. 

If G is an Abelian group which is either discrete or bicompact, the group of 
p-G-chains of K is defined to be s,G@, and that of L is o,G. The transforma- 
tion, 


Bate 
is a continuous (see F) homomorphism of s,G into s,1G. The subgroup 
”2°(K) of elements g € 8,G with ag = 0 is called the group of p-G-cycles of K. 
That ’3°(K) is closed follows from G. 
The subgroup ’§°(K) composed of the elements ag of s,G (@ now a trans- 
formation from sp4:@ into s,@) g € Sp4iG, is called the group of p-G-cycles bound- 
ing in K. That ’§°(K) « sG@ follows from H. Notice that 


d0g = (d’,---,d"") for g€ &pyG, 


where 


i=1 j=1 


since the product of the incidence matrices is zero. So: 

THEorEM 3.1. ’§°(K) ’3°(K) 8,)G. 

Corotuary. ?°(K)” « ’3°(L) (See 2c). 

If g is a chain of ?§°(K)” it is a chain of L and I”'g is a chain of ’§“(K) so 
by theorem 3.1 ag = 0. The part of ag on L is therefore zero and g e’3°(L). 

Let H be the character group of G. Then H is either bicompact or discrete. 
By a p-H-chain of K is meant an element (h;, --- , hs,) of s,»H, and similarly 
for L. The continuous homomorphism 


dh = (ki, 


ie 
+ 
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+1 . 
= h; t= 1, 


j=1 


carries s,H into 84H. Define ’3x(K) to be the subgroup: hes,H and 
dh = 0; ’§x(K) to be the subgroup 6h, hes,.H. The first is the group of 
| p-H-cocycles; the second the group of p-H-chains cobounding in K. As before, 


55h = (ti, ts,4,) Where hesp.1H and 


ur = 0, and: 


THEOREM 3.2. "§u(K) s,H. 
In future s,, ¢» will be abbreviated to s, ¢ 
TurorEM 3.3. ’§°(L) ’F°(K)’ oG 
a) < | 
If f = (f',---,f) there are in G that f’ = 
j= 1, +++, 9. Also = (f', ---,f7,0, Let g = 
(g', whereg’ = > op. Then = +0. 
When j So ther.h.s. of this equation is f. Whenj > o ther.h.s. is zero by (1). 
Hence dg = I-“f and I’’f is in ?§°(K) as well as in oG, so f e?§°(K)’ and a) 
follows. 
b) Theorem 3.1 gives ’¥*(L) «K oG and theorem 3.1 and lemma 3 give 
oG. 
This follows from theorems 3.1 and 3.3. 
| THEOREM 3.4. K ’3n(L) & oG. 
a. "Bn(K)* < Ifk = (hh, ---,he, he) andh’ = (hh, --- 
then h’ implies h which gives 


wth, = 0, 


j=1 


hy + = 0. 


So 


Diu? th; = 0 by (1) when 7 S oy41, and hence h’ € ?3x(L). 


b. Theorem 3.1 gives ’3x(L) « cH and theorem 3.2 and lemma 1 give 
oH. 

This follows from H and theorems 3.2 and 3.4. 

Two formulas from a paper by Whitney’ are now needed to give the next two 
theorems. 


* Whitney: On Matrices of Integers and Combinatrial Topology, Duke Journal, 3 (1937) 
pp. 35-45. See theorem 1 p. 36 and p. 40, (6.6) and (6.7). 
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THEOREM 3.5. 

THEOREM 3.6. 

THEOREM 3.7. 

By theorem 3.5, "§°(K)” 
"3u(K)*). 


(K) 


(sG, ’3u(K)). 
(sH, ’3°(K)). 
(0G, 3x(K)*). 
(sG, ’3n(K))’. But by theorem 2.1, ’§°(K)’ = 


TuroreM 3.8. ’§°(K)’ ?3°(L) = (0G, ’3n(K)*). 
This is a consequence of 3.5 and E, 3.7 and 2.5. 


If G and H are character groups, one being discrete, the other bicompact, 
groups involving homologies and groups involving cohomologies are paired in 
three theorems. 

4.1. ’9°(K) = °3°(K) — ’§°(K) and ’Sx(K) = ’3x(K) 
’¥u(K) are character groups. 


Turorem 4.2. K) = — 


?2,(L) — are character groups. 

Since "§°(K)’ (theorem 3.1, corollary) ’§°(K)’ ~ ’3°(L) = 
so K) = "§°(K)’ — 

TuroreM 4.3. ’©°(L, K) and Y = ’3n(L) — © ’3Bx(K)*] are 
character groups. 

The proofs of these theorems are by means of B, C, D and theorem 2.6. The 
identifications made to obtain the notation of 2.6 are shown in the following 
table, as are the theorems giving conditions i and ii of 2.6. 


4. Homologies and Cohomologies 


"§°(L) and K) = 


Thm. 2.6 4.1 4.2 4.3 
G sG oG oG 
H sH oH oH 
?23n(L) 
§a(K) ?3u(K)* "§a(L) ~ *Ba(K)* 
Y= §-¢ ’Ox(L, K) Y 
'r<G<e 3.1 3.3 3.3, Corollary 
| 1KS KH 33 3.4 3.4, Corollary 
 6=Ge 3.5, E 3.7 3.8 
9 = (H,T) 3.6, E 3.6, E 3.6, E 


¥ 
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| 
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Theorem 4.1 is the well-known duality theorem for the homology and co- 
homology groups of a complex. Theorem 4.2 shows that the group of p-G-cycles 
of L bounding in K, mod those of them bounding in L has for character group 
the group of p-H-cocycles of L, mod those of them which are the part on L of cocycles 
of K. 

THEOREM 4.4. K) © — *An(L, K), where ’Ay(L, K) is the 
group of cosets of ’S(L) each of which contains the part on L of a cocycle of K. 

By theorem 4.3, Y being the character group of ’°(L, K) is isomorphic to 
’Ox(L, K). Then by theorem 2.4 Y = ’Ox(L) — "An(L, K) where ’A,(L, K) 
is the B* of that theorem. 


CoRNELL UNIVERSITY. 
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ON THE THEORY OF STATIONARY RANDOM PROCESSES 
By Haratp CraMER 
(Received May 5, 1939) 
I. 


1. Let ¢ be a real parameter which may be regarded as representing time and 
let, for every t, X(t) denote a real-valued random variable. 
Following Khintchine [6], we shall say that X(é) is attached to a one-dimen- 


sional random process if, for any finite set of real values ¢,, --- , &, the simul- 
taneous probability distribution’ of the variables X(t), .-- , X(&) is known. 
This distribution may, e.g., be characterized by means of the corresponding 
k-dimensional distribution function F(a, , --- , 54, , t&), which represents 
the probability of the set of inequalities 
X(t) (vy = 1,2, --- , k). 
F is a distribution function in the variables 2,,---,2,, with 4,---,& as 
parameters. 
Any system of distribution functions F(a, &,---,t), where 


k = 1, 2,--- , such that every F is symmetric in all pairs (2, , ¢,) and satisfies 
the consistency relation 


for any j < k, defines a random process. 

For certain purposes this definition is not complete enough, and Doob [3] 
has given more elaborate definitions which meet the requirements of these cases. 
In the present paper we shall, however, only deal with problems where the 
simple definition given above is sufficient. 

The definition is immediately extended to an n-dimensional process, if an 
n-dimensional random variable Z(t) = {X,(#), --- , Xn(é)} is chosen as the basic 
variable of the process. The simultaneous distribution of the variables Z(t), 
--- , Z(t) is then characterized by a distribution function in a space of nk 
dimensions, which represents the probability of the set of inequalities 


X,(b) S Lp, (u = 1,2,---,n;» = 1,2,---,k). 


The corresponding extension of the conditions of symmetry and consistency is 
obvious. 


‘Cf the writer’s Cambridge Tract [1] on ‘(Random Variables and Probability Dis- 
tributions.’”’ 
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Finally, we may evidently assume the n components X,(¢) of the basic variable 
Z(t) to be complex random variables 

(1) X, (0) U,(t) +1 V, (4), 


and so obtain a complex multi-dimensional process. The simultaneous distribu- 
tion of Z(t), --- , Z(&) is then characterized by a distribution function F in a 
space of 2nk dimensions, which represents the probability of the set of inequalities 


S Uw 

S , 
where » = 1, 2,---,n;v = 1,2,---,k. F is a distribution function in the 
arguments u,, and v%,, , With t; , --- , 4 as parameters. 


2. Consider a random process defined by 


where, in the general case, the X,(¢) are complex random variables defined 
according to (1). It evidently amounts to the same thing if we consider simul- 
taneously the n processes defined by the components X;(t), --- , Xn(t), and we 
shall apply these two points of view without distinction in the sequel. 

We shall always assume that the mean values E{| X,(t) |?} are finite for all 
wand ¢t. It then follows that the mean values E{X,(t)} and E{X,(t)X,(u)} 
are finite for all uw, v, t and wu. 

Generalizing a definition introduced by Khintchine [6]’, we shall say that 
Z(t) defines a stationary random process, if the following two conditions are 
satisfied for all u and v: 


(A) E{X,(t)} = m, is independent of t; 
(B) E{\X,(®)X,(u)} = Ry(t — u) ts a function of the difference t — u. 


If these conditions are satisfied, we shall also say that the n processes defined 
by the variables X,(d), --- , X,(t) are stationary and stationarily correlated. 
Obviously we may without restriction of generality assume 


E{X,()} = m, = 0, 
E{| X,() R,,(0) >0 


for all 4. This being assumed, we shall denote the R,,(¢) as the correlation 
functions of the stationary processes attached to the variables X,(t), --- , Xn(t). 
More particularly, we shall call R,,(t) the autocorrelation function of the process 
defined by X,(t), while R,,(t) and R,,(¢) will, for » ¥ v, be called the mutual 


(C) 


? Khintchine introduces also another, narrower, concept of stationarity which, however, 
we shall have no occasion to use in the present paper. 
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correlation functions of the processes connected with X. and X,(t). It follows 
from (B) that we always have 
(2) R,,(—1) 


In the particular case when the variables X,(¢) are real, R,,(t)/(o,0,) is evi- 
dently identical with the ordinary correlation coefficient of X,(% + ¢) and 


X, (to). 
We shall finally assume that 
(D) lim Ryp(t) = = 
foru = 1, 2,---,m. It will be shown later (Lemma 1) that, if this condition 


is satisfied, all the correlation functions #,,(¢) are continuous for all real values 
of t. The process defined by each X,(t), as well as the one defined by the com- 
posite variable Z(t), will then be called a continuous stationary random process. 
This concept has been introduced by Khintchine [6], who has justly emphasized 
its great importance for various branches of applications. 


3. The main purpose of the present paper is to study the mutual relations 
of the correlation functions R,,(t). In Chapter III, we shall deduce a necessary 
and sufficient condition that a set of n’ a priori given functions R,,(t) may be 
correlation functions of n complex random processes satisfying the conditions 
(A)-(D). It will be shown to be necessary and sufficient that the R,,(t) 
should, for all real ¢, be given by Fourier-Stieltjes integrals of the form 


Rel) = 


where the F,,(a) are functions of bounded variation in (— ~, «) which satisfy 
certain conditions as specified by Theorem 1. In particular, it follows from 
these conditions that the functions F,,(x) corresponding to the autocorrelation 
functions R,,(¢) are real and never decreasing, with a total variation equal to 
o, , as already proved by Khintchine [6]. 

In Chapter IV, some further properties of the correlation functions will be 
deduced, and the particular case of random processes defined by real variables 
X(t) and X2(é) will be considered. Finally, in Chapter V we shall show that an 
analogous theory may be developed for the case when our random processes are 
considered for integral values only of the parameter t. 


II. Auxm1ary THEOREMS 


4. In this Chapter we shall, for the sake of convenience, give as Lemmas a 
number of results (partly previously known) wanted in the sequel. In the first 
place, we shall prove the following result indicated in the Introduction. 

Lemma 1. Let R,,(t) be the n° correlation functions of n random processes 
satisfying the conditions (A)—(D) of paragraph 2. Then R,,(t) is continuous for 
all real t, and we have 


(3) | | S 


|| 
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The inequality (3) follows immediately from (B) and (C) by means of the 
Schwarz inequality. Further, we have 


+ Al) Rw(t) = E{XO)[X(t + At) — 
and thus by the Schwarz inequality 
| Ru(t + At) — Rw(t)| S Xt + At) — 
= 202 — R,,(At) — Ry(—At)- 
According to (D), this tends to zero with At. 


5. We next consider some convergence properties of sequences of random 
variables. If X,, X2,--- and X are real or complex random variables, we shall 
say that X,, converges in the mean to X, and write 
lim. X, = X, 


whenever the mean value 
E{|X, — 


is finite for all sufficiently large n and tends to zero asn— ~. 
If the mean value 


E{| Xm — Xu 


tends to zero as m and n tend independently to infinity, then it is known’ that 
there is a variable X such that X, converges in the mean to X. If, in addition, 
; E\| X,, |’} is finite for all sufficiently large n, it immediately follows that E{| X |°} 
is finite. Further, if X’ and X” are variables such that X, converges in the 
mean both to X’ and to X”’, then X’ and X”’ are equivalent, i.e. 


Pr{X' = X"} = 1. 


Lemma 2. Let X,, --- and Y,, --- be two sequences of random 
variables such that E{| X,, |’| and E{\ Y,, \"} are finite for all n. Suppose that the 
mean values E{| Xm — X,\|"} and E{| Yn — Y, |"} both tend to zero as m and n 
tend independently to infinity, so that there are two variables X and Y such that 
E{| X |"} and E{| Y |’} are finite, and lim. X, = X, lim. Y, = Y. Then 
E(X,) > E(X), E(Y,) E(Y), 


E(XY). 


° Cf. e.g. Fréchet [4], Lévy [7].—It is also known that convergence in the mean implies 
convergence “‘in probability,’’ which is expressed by the relation Pr{|X, — X | > «} -0 
for any « > 0. 


| 
| 
| ™ 
| 
| 
| 


ON THEORY OF STATIONARY RANDOM PROCESSES 219 


In fact, we obtain by means of the Schwarz inequality 
| H(X,) — E(X) — X |} E{|X, — X 


and 
[E{|X, — E{| X PHP = — | X|)(| 


< E{(| — | X |)*}-Et(| X 
E{|X, — X 
< 2E{|X, —X 


Hence we immediately conclude that H(X,,) E(X) and E{| X — E{| X 
Applying the last result to the sequences X, + Y, and X, + 7Y, , we find that 
E(Xn¥n) > E(XY). 


6. In the sequel, we shall repeatedly have to deal with integrals containing a 
random variable attached to a continuous stationary random process. Integrals 
of this kind may be very simply defined in complete analogy to the elementary 
definition of an ordinary Riemann integral, as shown by the following Lemma. 

Lemma 3. Let X(t) be a (complex) random variable, attached to a continuous 
stationary random process, and let g(t) be a continuous function of tfora St S b. 
Consider a sequence of random variables Z, , Z2 , --- , defined by 


n 
v=] 


wherea = < <... < < = bd. Suppose that 
Max — i§"}) 


asn—» ©. Then Z,, converges in the mean to a random variable Z. Further, 
if {Z2} is another sequence formed with the same X(t) and ¢(t), but with another 
system of points tS” satisfying the above conditions, and converging in the mean 
to the limit Z*, then Z* and Z are equivalent. For the limit Z we write, by definition, 


b 
[ v(t) X (0) dt, 


so that the last integral is a random variable, which is hereby completely defined 
but for equivalence. 

It is evident that we may assume E{X(t)} = 0. 

Putting 


Z' = p> — ta), 


= o(t, )X (t) — 1), 


| 
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it will obviously only be necessary to show that the mean value E{| Z’ — Z”’ |* 
can be made as small as we please by taking the length 6 of the maximum 
interval in the two divisions corresponding to the points t, and t,” sufficiently 
small. In fact, this point being established, it directly follows that any sequence 
{Z,,} or {Z,} converges in the mean, and that the mean value E{| Z, — Z* |*} 
tends to zero. Hence the limits Z and Z* must be equivalent. 

From the current text-book theory of the ordinary Riemann integral, it is 
well known that we may restrict ourselves to the case when the division points 
t} (v = 0, 1,---,m) are all contained among the points t, (v = 0, 1, --- ,n). 
This being assumed, we may write 


where 
| t — 4 | <6. 


Denoting by R(t) the autocorrelation function of our continuous stationary 
process, we then have 


E{|Z’ — 


=D fer) — 14) — — tea) — 


g(t) and R(t) being both continuous functions, it easily follows that the last 
expression can be made less than any given e by taking 6 sufficiently small. 
Thus the Lemma is proved. 

It is easily shown that the elementary properties of the ordinary Riemann 
integral hold for the integral of a random variable as defined by Lemma 3. 
Integrals of this type are themselves random variables, and we shall have 
occasion to consider in the sequel certain mean values connected with such 
variables. In this respect, we shall require the following Lemma. 

Lemma 4. Let X,(t) and X(t) be random variables attached to continuous sta- 
tionary processes satisfying the conditions (A)—(D) of paragraph 2, and let ¢,(t) 
and yo(t) be continuous functions of tfora St Sb. Then 


(4) BY [ gi(t)Xy(t) dt- [ y2(u) Xo(u) au} = [ Rig(t — u) dtd. 
In particular, for X,(t) = X2(t), v(t) = we have 


[ gi(t) de } = [ — u) dt du. 


| 
| 
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The last expression, being the mean value of a real and non-negative variable, is 
evidently real and non-negative. 

In order to prove this-Lemma, we have only to express the integrals in the 
first member of (4) as limits of finite sums of random variables according to 
Lemma 3. The mean value of the expression thus obtained is equal to a 
Riemann sum corresponding to the double integral in the second member of (4). 
An application of the last relation of Lemma 2 then immediately yields the 


desired result. 


III. Matn THEOREM ON THE CORRELATION FUNCTIONS 


7. We now proceed to the proof of the following theorem, which constitutes 
the main result of the present paper.‘ 

TueorEM 1. a) Let X,(t), --- ,X,() be complex random variables attached 
to n continuous stationary processes satisfying the conditions (A)—(D) of para- 
graph 2. Then the correlation functions defined by (B) are for all real t given by 
Fourier-Stieltjes integrals of the form 


6 = 


where the F,,(x) are functions of bounded variation in (—~, ©), which we may 
always assume to be everywhere continuous to the right. Further, denoting by 
AF,, the increase F (8) — Fyuy(a) of Fy»(x) over the real interval a S x S B, the form 


(6) Hla, eet, Zn) yy 

is for any closed interval (a, B) a non-negative Hermite form. (In particular, 
putting all the z-variables except z, equal to zero, it follows that F(x) ts real and 
never decreasing. ) 

b) Conversely, let Ry(t) be a set of n° functions given by the Fourier-Stieltjes 
integrals (5), the functions F,,(x) being of bounded variation in (—», ©) and 
everywhere continuous to the right, and suppose that the form (6) is, for any closed 
real interval (a, B), a non-negative Hermite form. Then it is possible to find n 
continuous stationary random processes having the correlation functions R,,(t). 


8. In order to prove the first part of the theorem, we shall begin by deducing 


the expression (5) for the correlation functions. We put, for any real x and 
any A > 1, 


(7) Q(x, A) = [ [ — we 


* Professor A. Kolmogoroff has kindly informed the author that the same theorem has 
been found by him. 
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According to Lemma 1, R,,(¢) is a bounded and continuous function, and it is 
then known’ that a necessary and sufficient condition for a representation of 
R,,(t) of the form (5), where F,,(x) is of bounded variation in (— ©, ), is 


(8) [ | gu(2, A) |dx < const. 

for all A > 1. Moreover, a necessary and sufficient condition for a representa- 
tion of the form (5) with a real and never decreasing F,,(x) is that g,,(x, A) 
should be real and non-negative for all real z and all A > 1. If the latter 


condition is satisfied, then (8) holds a fortiorv. 
Now according to Lemma 4 we have 


1 —izt tzu 
X, (be at. | X,(uje au 


1 
[ [ Rw(t — dtdu = gy(a, A). 


which shows that g,,(x, A) is real and non-negative, so that we have 


(9) 


Thus in particular we have for u = » 


a 


R,,(t) = [ e dF (x), 


where F,,(x) is real and never decreasing. From | R,,(t) | < o% it then follows 
(Cramér [2], formula (6)) that 


(10) A) dx So}. 
On the other hand, we obtain from (9), using the Schwarz inequality, 


[ ‘ X, dt. X,(u)e™ au} 


(2A)? 


Juul, A) A). 


Hence we deduce by (10), using once more the Schwarz inequality, 


| A) |? 


A) | de eyes. 


5 Cramér [2]. 
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Thus the condition (8) is satisfied, and the representation (5) is established. 
Obviously, we may always assume that the functions F,,(x) are everywhere 
continuous to the right, since a modification of F,,(x) in an enumerable set of 
points does not affect the value of the integral in the second member of (5). 


It remains to prove the assertion concerning the form H(z, , --- , zn) defined 
by (6). Obviously it is only necessary to show that, for arbitrary complex 
%,-°::,%n, this form is always real and non-negative. 

Let (a, 8) be a given real interval and put 

itp 
- 
g(t) = 


According to Lemma 4, the mean value 


} 
exists for any M > 0 and for arbitrary complex z,, and is always real and 
non-negative. We have, however, by Lemma 4 


Hula, Zn) = g(t) X,(t) af o(u)X,(u) au) 


au) = Daz watdu 


Let us first suppose that a and 6 have been chosen such that the functions 
F,,(z) are all continuous for x = a and for x = 8. Given any e > 0, the ex- 


pression 
dt = sin t(x — a) dt sin — 6) dt 
M Jo t Jo t 
tends for indefinitely increasing M to the limit zero, uniformly for x < a — ¢ 
and for > B + and to the limit unity, uniformly fora + «<2 
In the remaining intervals | — a| S e and |z — B| S «, we have 


M 
[ g(t) dt 
M 
It then follows from (6) and (11) that we have 
lim Hula, y Zn) = Zn). 
Hu(z, +--+ ,2n) being, according to the above, real and non-negative, the same 
holds true for H(z,,---,2n,). As the functions F,,(xz) are everywhere con- 


tinuous to the right, this result is immediately extended to arbitrary real values 
of a and 8, and the first part of the theorem is proved. 


< 2. 


| 
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9. Let R,,() be a set of n° given functions satisfying the conditions of the 
second part of the theorem. We then have to show that we can find n con- 
tinuous stationary random processes having the functions R,,(¢) for their corre- 
lation functions. It will, in fact, be shown that n processes with the given 
correlation functions can be defined by random variables of the form 


(12) = U,(t) + (u = 1, 2,--. ,n) 


in such a way that the simultaneous probability distribution of the 2nk real 
random variables 


(13) Uultr), Vults); 


is always a normal distribution. 

A normal distribution in any number of variables is completely defined by 
its first and second order moments. The functions R,,(¢) being given, we now 
define the normal distribution of the 2nk variables (13) by putting 


E{U,(t,)} = E{V,(t,)} 0, 


= ts) + t,) 
4 ’ 


— R(t, ts) + Ry lts t,) 

47 
From our hypotheses concerning the given functions R,,(é), it easily follows 
that the relation (2) is satisfied, so that the second order moments defined by 
(14) are real. In order to show that these moments really define a normal 
distribution in the variables (13), it is sufficient to show’ that the quadratic 
form Q(u, v) in the 2nk real variables u,; , v4, , defined by 


+ Vur Vys) ane t,) : Ry (ts tr) 


E{ Vi(ts)} = 


+ (Upr dvs — Uys Vur) 


47 


is non-negative. If this is so, the moments (14) define a normal distribution 
with the characteristic function 


Using (2) we obtain, however, by some easy transformations 


Q(u, v) = 2 (Uyr Wyr) (Uys + — t,). 


6 Cf. Cramér [1], p. 199. 
7 Cf. Cramér, l.c. 
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the Substituting here the expression (5) for R,,(t), we obtain further 
re 
ven 
where 
0) 2 = 2,(z) = — 
eal It now follows from our hypothesis concerning the form (6) that Q(u, v) is never i 
negative. 
k), Thus the moments (14) define a normal distribution in the 2nk variables (13). ' 
Evidently this distribution satisfies the necessary symmetry and consistence 
conditions, so that the corresponding complex variables (12) define a multi- 
by 
itd dimensional random process. From (14) we obtain, however, 
= + Vall) — Vt) + Ut) 
= — t). 
Since ¢, and ¢, are arbitrary, this shows that the n random processes defined f 
by the variables (12) are stationary and stationarily correlated processes, the 
correlation functions of which are identical with the given functions R,,(¢). 
Thus our theorem is completely proved. 
ws IV. FurTHER PROPERTIES OF THE CORRELATION Functions. THE Case 
by or REAL PROcEssES 


10. Let us first observe that the form H defined by (6) is non-negative and 
Hermitian for any interval (a, 8) if, and only if, the same holds true when 
AF,, denotes the total increase 


of F(z) over any finite or enumerable set of non-overlapping closed intervals 
(a,, B,). Thus Theorem 1 remains valid if we introduce this more general 
} definition of AF ys. 

Without restriction of generality, we may assume F,,(— ©) = 0. Any F,,(x) 
being of bounded variation in (— ©, ©) and everywhere continuous to the 
right, it is known that its points of discontinuity xz, form at most an enumerable 
set, that a finite derivative F,,(2) exists almost everywhere, and that we may 
write for all z 


(15) F(t) = FO (2) + FO (a) + FY (2). 


Here 


n 


F(a) = [ Felt) at 
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(the integral being taken in the Lebesgue sense) is the absolutely continuous 
component of Fry ; 
F (x) Pr 

is the discontinuous component (p, being the saltus of F,, at x = 2,); finally 
(x) is the singular component. F,;)(x) is everywhere continuous and has 
almost everywhere a derivative equal to zero. There is a set Z of measure 
zero such that the total variation of F(x) over Z is equal to the total variation 
of (x) over (— 2%, ©). The three components in the representation of F,,(z) 
according to (15) are uniquely determined by F,,(z). 

Substituting the expression (15) for F,,(x) in the definition (6) of the form 
H(z, +--+ ,%n), we obtain H = H, + Ha + H, , where 


Hila, +++, = 2,2, AF 
and similarly for Ha and H,. We then have the following theorem. 

THreoreM 2. If the form H is, for any interval (a, B), a non-negative Hermite 
form, then the same holds true for each of the forms H,, Haand H,. The converse 
is obvious. 

Denoting by x any point such that all derivatives F/,(x) exist, and observing 
that, by hypothesis, the form H corresponding to the interval (x, x + dz) is 
non-negative, we find that 

By 
is, for almost all z, a non-negative Hermite form. Integrating between the 
limits a and 8, it follows that the same holds true for H,(z,, --- , Zn). 

Similarly we find, denoting by p,, the saltus of F,, in a certain point x and 
considering the form H corresponding to the interval (2 — «, x + e), that the 
form is non-negative Hermite form. Adding together the analo- 
gous forms for all discontinuities x belonging to (a, 8), we obtain the desired 
result for Ha. 

Finally, it is seen without difficulty that, to any closed interval (a, 8) and 
any given e > 0, we can find a sub-set EF of (a, 8) consisting of a finite or 
enumerable number of non-overlapping closed intervals of total length < «, 
such that the total variation of any FS; over E differs by less than ¢ both from 
the total variation of F{) over (a, 8) and from the total variation of F,, over E. 
Denoting by AF,, the increase of F,, over E, the corresponding form H is, 
according to the above, a non-negative Hermite form. Allowing now e to tend 
to zero, we obtain the form H, corresponding to the interval (a, 6), and the 
proof of the theorem is completed. 


11. Consider now the particular case n = 2 of our previous theorems. We 
then have two continuous stationary processes, defined by the random variables 


n 
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X,() and X,(é). For n = 2, the form H defined by (6) is a non-negative Her- 
mite form if, and only if, Fy, and F22 are real and never decreasing, AF; = AF , 


and 
(16) | AF |? AF y,- AF 


From Theorems 1 and 2, we then immediately obtain the following result. 
TueorEM 3. Four given functions R,,(t), (u, » = 1, 2), are the correlation 

functions of two continuous stationary processes if, and only if, the R,,(t) are 

given by the Fourier-Stieltjes integrals (5), where Fy, and F22 are real and never 


decreasing, = Fy», and 


(17a) |AFY? AF{? 
(17d) | AF? ? < 
(17s) |AF{? |? 


for any interval (a, B), the notation being the same as that used in connection 
with (15). 
The condition (17a) is obviously equivalent to the condition that almost 
everywhere 
Further, (17d) is equivalent to the condition that the discontinuity points of 
F\, form a sub-set of the set of points which are discontinuity points both of 
Fy and of Fx, and that the saltuses p,, corresponding to any discontinuity 
point satisfy the inequality 
| |? S Pu Poe. 
Finally, it should be observed that the three conditions (17) are equivalent to 
the single condition (16). 


12. Let us now assume that the two random processes considered in the 
preceding paragraph are real, i.e. that they are defined by two real random 
variables X,(¢) and 

Any correlation function R,,(é) being in this case real, its expression as a 
Fourier-Stieltjes integral may be written 


Ry(t) = dF 


[cos tx df,»(x) + sin tx dg,,(x)], 


where f(z) and g,,(#) are real functions of bounded variation in (0, ©). We 
may, in fact, take 
Sw(x) = + Fw(—2)], 


= + F,,(—2)]. 
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Since F's, and Fe are real and never decreasing, we then have gu = gx = 0, 
while f,, and fo are never decreasing. f,, and g,, may be resolved into com- 
ponents of the types (a), (d) and (s) in the same way as F,, according to (15). 
Theorem 3 now immediately yields the following result. 

THEorEM 4. Four given functions R,,(t), (u, v = 1, 2), are the correlation 
functions of two real continuous stationary processes if, and only if, the R,,(t) are 
given by the expressions 


Ry (t) = cos tx dfi(zx), 


where the f,, and g,, are real functions of bounded variation in (0, ~) such that 
fu and fo. are never decreasing, while 

(18a) (Afi?)’ + Afi? Afie’, 

(18d) (Afi)* + Aft? -Afz’, 

(18s) + (Agi)? Aft? -Afs’, 

for any interval (a, B) withO <a < B. 


As in the preceding paragraph, the conditions (18a) and (18d) are equivalent 
to the conditions 


(fia)’ + (giz)? S 


and 


respectively, p,, and q,, denoting here the saltuses of f,, and g,, at any given 
point 2. 

Finally, the three conditions (18) are evidently equivalent to the single con- 
dition 


(Afi2)” + (Agiz)” S Afu-Afes . 


V. On Discrete SraTionary PRocessss 


13. Let us consider a sequence of (real or complex) random variables X(t), 
where t = 0, +1, +2, --- , such that the simultaneous probability distributions 
of the variables X(t,), --- , X(t) are determined for any finite set of integral 
t,---,t. If these probability distributions satisfy the conditions of sym- 
metry and consistency stated in paragraph 1, we shall say that the sequence 
X(t) defines a discrete random process. 
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Let, now, the variables X;(é), --- , Xn(¢) define n discrete random processes 
such that for all integral values of ¢ and u 


E{X,()} = 0, 


(19) E{X,()X,(u)} = — u), 


id so that the latter mean value is a function of the difference ¢ — u. We shall 
then say that the n processes defined by the X,(¢) are stationary and stationarily 
correlated. For the general theory of the discrete stationary process, reference 
may be made to Wold [8]. 

In this case, our correlation functions R,,(t) become sequences of numbers, 
depending on the (positive or negative) eet reteed variable ¢. Without 
restriction of generality we may assume 


(20) Ry(0) = = 1. 


The quantity R,,(t) is then, in the particular case when X,(t) and X,(¢) are real, 
ul identical with the ordinary correlation coefficient between X,(& + ¢) and 
X,(#). Even in the case of complex variables X,(¢), we shall call the R,,(¢) 
the correlation coefficients of our n processes. For u = », R,,(t) yields the 
autocorrelaiion coefficients of the process defined by X,(t), while R,,(¢) and R,,(t) 
constitute, for 4» # », the mutual correlation coefficients of the processes connected 
with X,(¢) and X,(¢). We obviously still have R,,(t) = R,,(—?). 

For the correlation coefficients of a group of n discrete stationary processes, 
we can prove theorems which are completely analogous to our above theorems 
for the continuous case. In fact, the only modification required is to replace 
in the Fourier-Stieltjes integral (5) the infinite interval of integration by an 
interval of length 2x and, at the same time, to restrict the variable ¢ to integral 
values. Theorem 1 will then be replaced by the following theorem. 

THEorEM 5. a) Let X(t), ---, Xn(t), where t = 0, +1, +2,---, ben 
sequences of complex random variables attached to n discrete stationary processes 
satisfying the conditions (19) and (20). Then the correlation coefficients R,,(t) 
defined by (19) are for all positive and negative integral t given by the Fourier- 
Stieltjes integrals 


where the F,,(z) are functions of bounded variation in (—x, m), which we may 
always assume to be everywhere continuous to the right. Further, the form 
H(2,,--- ,2n) defined by (6) is, for any closed interval (a, 8) with —x S a < 
8 S x, a non-negative Hermite form. (In particular, it follows that F(x) is real 
and never decreasing, with a total variation equal to unity.) 

b) Conversely, let Ry(t), for t = 0, +1, +2, ---, be a set of n° sequences of 
quantities given by the Fourier-Stieltjes integrals (21), the functions F,,(x) being 
of bounded variation in (—x, x) and everywhere continuous to the right, and 
suppose that Ry(O) = 1 and that the form H(z, --- , 2n) 18, for any closed interval 
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(a, B) with —w S a < B S zw, a non-negative Hermite form. Then it is possible 
to find n discrete stationary processes having the correlation coefficients R,,,(t). 
The proof of this theorem is quite similar to the proof of Theorem 1. We 


only have to replace the definition (7) of the auxiliary function g(x, A) by 
1 a —iz(t—u) 


According to Hausdorff [5], a necessary and sufficient condition for a representa- 
tion of R,,(t) of the form (21), where F,,(x) is of bounded variation in (—z, z), is 


[ | A) | dx < const. 


for all A > 1. Moreover, a necessary and sufficient condition for a representa- 
tion of the form (21) with a real and never decreasing F,,(x) is that g,,(x, A) 
should be real and non-negative for all real x and all A > 1. These conditions 
are, of course, completely analogous to those given above for the case of a con- 
tinuous variable ¢ and a representation of the form (5), and all the rest of the 
proof runs on quite similar lines. 

In order to obtain, for the case of discrete processes, analogues of our theorems 
2, 3 and 4, we only have to replace everywhere the infinite Fourier-Stieltjes 
integral (5) by the corresponding finite integral (21). In theorems 3 and 4, 
the interval (0, ©) is then replaced by (0, 7). Otherwise everything remains 
unchanged. 
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I. Inrropucrion’ 


Given a geometrical cell-complex K and a polyhedron Y, let us consider all 
continuous mappings f(K) C Y. These maps are divided into homotopy 
classes, the maps in any class being homotopic to each other. The problem of 
determining these classes by means of known invariants (for instance, homology 
properties) of K and Y is of extremely great importance in modern topology. 

The discussion of the case when K = S'‘ is an 7-dimensional spherical manifold 
led Hurewicz’ to the definition of the 7** homotopy group ;(Y) of Y. Although 
our knowledge of these groups is still very imperfect, they have proved to be a 
powerful instrument in all considerations connected with the general question. 

Special interest has always been paid by topologists to the case when Y = S". 
In this direction there is the fundamental result of Hopf solving the question 
completely if K = K".. As newly shown by Whitney‘ this theorem may be 
stated as follows: 

The classes of maps of K" into S" are in a (1 — 1)-correspondence with the 
elements of the n** cohomology group "H,(K") of K” with the group I of all integers as 
coefficient group. 

The theorem holds even if we replace the condition Y = S" by the condition 
m(Y) = Ofori < n(n > 1), provided the group J is replaced by 7,(Y). This 
shows that the appearance of the group J in Hopf’s theorem (as stated above) is 
due to the fact that I and 2,(S") are isomorphic. 

The use of cohomology in Hopf’s theorem is natural, also, for the following 
reason. The theorem and the proofs hold for infinite, locally finite complexes 
as well as for finite, provided infinite chains and cocycles are admitted.’ The 
statement of Hopf’s theorem for infinite complexes in the language of homology 
seems to be much more complicated.° 


‘ Some of the results were published by the author without proofs, in C. R. Paris 208 
(1939), p.68. See also H. Freudenthal, Proc. Akad. Amsterdam 42(1939), p. 139. 

*W. Hurewicz, Proc. Akad. Amsterdam 38(1935), pp. 112-115. 
. * K" stands for the closed subcomplex of K consisting of all its cells of dimension Sn; 

‘H. Whitney, Duke Math. Jour. 3(1937), p. 51. References to Hopf, Hurewicz, and 
others will be found there. 

°C. H. Dowker, Proc. Nat. Acad. U. S. A. 23(1937), p. 293. 

‘In a note published in C. R. Paris 206(1938), p. 1436, L. Pontrjagin signalizes some 
results concerning the case K = K+1, Y = 8S". (n + 1)-chains with coefficients from 
™ a(S") are implicitly introduced. 
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In a recent paper’ I have made systematic use of chains with elements of 
homotopy groups as coefficients. Let Q’ be a geometrical cell-complex which is 
an r-dimensional oriented combinatorial manifold, and let P* be an 7-dimensional 
closed subcomplex of Q’. Given a map f(Q’ — P') C 8", I define an i-chain 
y'(f) in P* with coefficients in as follows: Let be an i-cell of and 
s’ "a “small” (r — i — 1)-sphere contained in Q’ — P' and “simply linked” 
with o', then the element of 2,—~;-1(S") defined by the map f(s” "") C S" is the 
coefficient of cin y'(f). I proved that this chain is a cycle and that its homo- 
logy properties are closely related with the homotopy properties of f. 

The purpose of the present paper is to build up an analogous theory for 
arbitrary complexes. In parts III, IV, and V we develop such a theory which, 
applied to manifolds, will at once give us, in part VI, all previous results con- 
cerning y‘(f), even in a slightly stronger form. 

The basic definition is introduced in part III as follows: Let K be an arbitrary 
geometrical cell-complex and f(K") C’Y a continuous mapping. Let o”™’ bea 
(n + 1)-cell of K and c(f, o”**) the element of ,(Y) defined by considering f on 
the boundary of o"*’. Taking c(f, o”**) as the coefficient of o”**, we obtain an 
(n + 1)-chain c""’(f). It is proved that c"*'(f) is a cocycle. Extension theo- 
rem I, which is the main theorem of this paper, shows how closely the cohomology 
properties of this cocycle are connected with the extension-possibilities of f. 
In all of part III Y can be an arbitrary topological space which is simple in 
dimension n, a condition introduced in part II that is necessary in order to make 
the definition of unique. 

Part IV contains the application to the case when 7;(Y) = Ofori <n. A 
generalization of Hopf’s theorem is given which includes the generalization of 
Hurewicz-Whitney* and a generalization given by the author® arising from 
replacing the hypothesis K = K” by some hypothesis concerning cohomology 
groups for dimensions > n. 

A homology interpretation of the results of part IV is given in part V, under 
some additional hypothesis on Y. 

Two appendices discussing special topics are given at the end of the paper. 


II. PRELIMINARIES 


1. Let K be a geometrical locally finite complex, with oriented convex cells 
a; of dimension n = 0,1, ---. Their number may be infinite and their dimen- 
sions may form an unbounded sequence. The cells are open, and the closure of 
the n-cell «7? will be denoted by ¢?. ; 

Let ;; = 1, —1 or 0 according as oj” is positively, negatively, or not at all, 
on the boundary ¢? — a? of o?. The boundary and coboundary of a are 
defined by 


7 Fund. Math. 31(1938), pp. 179-200. 
5 Compositio Math. 6(1939), p. 429. 
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An n-chain is an infinite linear form A" = >>; ao? , in which a; are elements 
of an abelian group G’. The boundary and coboundary of A“ are defined by 


2. Let K’ be a closed subcomplex of K. An n-chain A” = )°; ajo? is con- 
tained in K’ or in K — K’ (notation: A" C K’ or A" C K — K’) if a; = 0 for 
each n-cell o7 in K — K’ or in K’. Clearly A" C K’ implies aA” C K’ and 
A" CK — K’ implies 5A" C K — K’. 

A" is a cycle mod K’ if dA” C K’; A" is a cocycle in K — K’ if A" C 
K — K' and 6A" = 0. Two cycles Ag and A; mod K’ are homologous mod K’ 
(notation: Ag ~ Az mod K’) if there is an (n + 1)-chain A”** such that 
gA"*'— A? — Aj CK. Twococycles Ap and A? in K — K’ are cohomologous 
inK — K’' (notation: Aj — Af in K — K’) if there is an (n — 1)-chain A" * C 
K — K’ such that 6A” * = Ao — Af. 

Using the relations 3A" = 0 = 66A” we may define as usual the homology 
and cohomology groups 


_"H°(K) mod "H.(K — K’), 


where G is an arbitrary abelian group whose elements are taken as coefficients 
in the chains. 

If K’ = 0 we write Aj ~ Aj, Ad ~ Af and "H%(K) instead of Aj ~ 
Ai mod K’, Aj Af in K — K’ and "H*(K) mod K’. 

Everything can be repeated starting from finite chains. We shall use similar 
notations, replacing ~, — and H by ~*, —* and H%*. 


3. Lemma. Given an n-chain A”, an (n + 1)-cell o”™* and an n-cell o® C 
a""', there is an (n — 1)-chain A” such that 
(3.1) 6A"? = ao” + AT, AI CK aeG. 

Proor. Let o”, of , 02 ,-+-,07 be all the n-cells of ¢"** and let 

A" = Bo" ++ Bio? +B", CK 
i=1 
The boundary ¢"*? — o¢”"*! of o”** being an n-dimensional manifold there is” 
an (n — 1)-chain A?’ with integer coefficients such that 
= o} — eo” + BI, = +1, Bi CK 


Writing A"* = = 6 + and A? = B+ 
we obtain (3.1). 


; More exactly, an n-chain is a function with n-cells as arguments and elements of G as 
values, 


'’ H. Whitney, Duke Math. Jour. 3(1937), p. 44. 
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1. Let K be a geometrical locally finite complex, with oriented convex cells 
o; of dimension n = 0,1, --- . Their number may be infinite and their dimen- 
sions may form an unbounded sequence. The cells are open, and the closure of 
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7 Fund. Math. 31(1938), pp. 179-200. 
8’ Compositio Math. 6(1939), p. 429. 
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An n-chain is an infinite linear form A" = >>; ao? , in which a; are elements 
of an abelian group G’. The boundary and coboundary of A" are defined by 


0A” = 6A" = ado? 


2. Let K’ be a closed subcomplex of K. An n-chain A” = >; ajo? is con- 
tained in K’ or in K — K’ (notation: A” C K’ or A" CK — K’) if a; = 0 for 
each n-cell of in K — K’ or in K’. Clearly A" C K’ implies aA" C K’ and 
A" CK — K’ implies 6A" C K — K’. 

A" is a cycle mod K’ if dA” C K’; A” is a cocycle in K — K’ if A" C 
K — K' and 6A" = 0. Two cycles Ag and A; mod K’ are homologous mod K’ 
(notation: A} ~ Af mod K’) if there is an (n + 1)-chain A” such that 
— At — Ao CK. Two cocycles Aj and Af in K — K’ are cohomologous 
in K — K’' (notation: Aj — Af in K — K’) if there is an (n — 1)-chain A" ' C 
K — K' such that 6A” * = Ag — Af. 

Using the relations aA" = 0 = 66A” we may define as usual the homology 
and cohomology groups 


"H°(K) mod K’, "H.(K — K’), 


where G is an arbitrary abelian group whose elements are taken as coefficients 
in the chains. 

If K’ = 0 we write Aj} ~ Af, Aj — Af and "H*%(K) instead of A} ~ 
Af mod K’, Aj — Aj in K — K’ and "H°(K) mod K’. 

Everything can be repeated starting from finite chains. We shall use similar 
notations, replacing ~, — and H by ~*, —* and H%. 


3. Lemma. Given an n-chain A", an (n + 1)-cell o”** and an n-cell o” C 
s"”, there is an (n — 1)-chain A” such that 
(3.1) <A* —6A”™' = ac" + Af, aeG. 

Proor. Let 6”, of , 02, ---, be all the n-cells of and let 

= Bo" + B07 + B’, K 
i=1 
The boundary ¢"** — ¢”*? of o”** being an n-dimensional manifold there is” 
an (n — 1)-chain A?~* with integer coefficients such that 
6A? = — go" + B?, «= +l, 

Writing = a=68+ 6B; and Af = B + Dias 8: B? 


we obtain (3.1). 


* More exactly, an n-chain is a function with n-cells as arguments and elements of G as 
values. 


‘* H. Whitney, Duke Math. Jour. 3(1937), p. 44. 
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4. Let X and Y be two topological spaces and Y~ the family of all continuous 
transformations f such that f(X) C Y. Given fe Y* and A C X, we denote 
by f| A the “partial’’ function obtained considering f only on A. Y*(A, f) 
will be the subfamily of Y~* containing all the functions f’ such that f’ | A = f | A, 

Let us fix a point yo ¢ Y. We shall denote by 0 the function mapping the 
whole of X into yo , 0 will also stand for 0 | A and Y*(A, yo) for Y*(A, 0). 

We shall denote the closed interval (0, 1) by E and the Cartesian product of 
X and E by X X E. 

Two functions fo , fi € Y*(A, f) will be called homotopic relative to A (notation: 
fo ~f, vel. A) if there is a g e Y**” such that 


g(a, 7) =fi(x) for = 0,1, 

g(x, t) = f(x) for reA,teE. 
If A = 0, the functions fo and f; will merely be called homotopic (notation: 
fo ~ fi) instead of homotopic rel. 0. 


In this way the family Y~ is divided into homotopy classes and Y*(A, f) into 
homotopy classes rel. A. 


5. It is well known” that the set 7 = K X 0 + K’ X Eis a retract” of the 
product K X E, where K’ is a closed subcomplex of K, and therefore 
(5.1) Every f Y” has an extension f' « Y**". 

Let A C K’ and fe Y*. 
(5.2) Given foe Y*(A, f) such that fo|K’ ~ f|K’ rel. A, there is an 


fo « Y*(K’, f) such that fo ~ fo rel. A. 
Proor. Let g « Y* *” be such that 


g(x, 0) fo(x), g(x, 1) f(x) for x 
g(x, t) = f(z) 


Writing g'(x, t) = g(a, t) for (x, t) eK’ X E, 
g'(x, t) = fo(x) for (27, XO 


we have g’ « Y’ and by (5.1) there is an extension g” « Y**" of g’. The fune- 
tion fo(x) = g’’(z, 1) satisfies the conditions of (5.2). 
Taking A = 0 in (5.2) we have 
(5.3) Given fo, fi ¢ Y* such that fy| K’ ~f, | K’, there is an fo « Y* such that 
fo ~ fo and fo | K’ = fi | K’. 


6. Let S" be an oriented n-dimensional sphere, let S" = E? + E” be a 
decomposition of S” into two hemispheres (oriented as S"), and let 2 be a 
point of the equator = . 


1 See P. Alexandroff and H. Hopf, Topologie J, Berlin 1935, p. 501; K. Borsuk, Ann. Soc. 
Polon. Math. 16(1937), p. 218. 
2A CX isaretract of X if there isanr AX such that r(x) = zforz «A. 
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We shall denote by [f] the homotopy class rel. zo of a map f « Y*"(xo , yo). 
Given fo , fr € Y°"(ao , yo), we obviously have 


fo|E” ~ Orel. and ~ Orel. 
Therefore, by (5.2), there are two functions fo €[fol and fi € [fi] such that 
fo|E2 =0 and fi| Et =0. 
Writing 
f| Et = fol Et and = fi| E* 
we have f e Y°"(ao, yo). We define 
([f] = [fo] + [fil. 


It is easy to see that with this definition the homotopy classes rel. 2 of 
Y"(z», yo) form a group t = mn(Y) which is called” the n** homotopy group 
of Y (with respect to yo). The unit element of this group is obviously the 
class [0]. 

(6.1) is abelian for n > 1." 


7. We shall call Y simple in dimension n,” or, more briefly, n-simple if every 
homotopy class of Y*" contains exactly one homotopy class rel. x of Y*"(2o , yo). 
In other words, Y is n-simple if 

(a) for each f there is an f’ Y°"(ao , yo) such that f’ ~ f 

(b) fos fie Y°"(xo, yo) and fo ~ f, imply fo = fi rel. x0. 

It can easily be shown, using (5.3), that (a) is equivalent with 

(a’) Y is arcwise connected. 

From now on we shall assume that Y is arewise connected. 

Obviously if Y is n-simple then every f « Y*" determines uniquely an element 
of x, , and we may suppose that the elements of z, are the homotopy classes of 
y*. This is the property that makes n-simple spaces useful for our further 
discussion. 

(7.1) mis abelian if and only if Y is 1-simple.”” 
(7.2) If m, = 0 then Y is n-simple.” 

(7.3) If m, = 0 then Y is n-simple.” 

(7.4) is n-simple for r = 1, 2,---. 


8. Let E” be an oriented n-dimensional element bounded by 8". Let us 
choose two homeomorphisms 


= E” and h(E") = E” 


“ W. Hurewicz, Proc. Akad. Amsterdam 38(1935), p. 113. 
“ [bid., p. 114. Though m is in general non-abelian, it is abelian in all the cases con- 
sidered here, so that additive notations will be used throughout this paper. 
S. Eilenberg, Fund. Math. 32(1939), pp. 167-175. 
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such that h, = h_| = being the equator of S". Let 
us agree further that h, transforms E} into E” positively. Then h_ transforms 
E* into E” negatively. 

Given two functions fo , Y*" such that fo = fi | let (fo, fi) 
be a function defined by 


(fo, fi) | Bt = fohs, (fo, fi) | = frh_. 


Assuming that Y is n-simple we denote by d(fo , fi) the element of 7,, corre- 
sponding to (fo, fi). 
(8.1) d(fo, fi) = 0 af and only if fo ~ fi rel. S”™”. 
(8.2) If fo, fi,feeY™ and fo| S”* = fi| = fa| then d(fo, fi) + 
d(fi, fo) = d(fo, fr), 
(8.3) d(fo, fi) = —d(fi, fo). 


(8.4) Given foe Y™ and wer, there is anf, such that fo| S” = f,| 8" 
and that d(fo, fi) = a. 
Proors. Ap (8.1). d(fo, fi) = 0 is equivalent to (fo , fi) ~ 0 and this ob- 
viously holds if and only if fo ~ f, rel. S”™. 
Ap (8.2). Since f; ~ 0 there is a map g; « Y""*” such that 


g(z,0) =filz), glz,1) = y for re E" 
using (5.1) we can find two maps go , gz € Y*"*” such that 
gi(a,0) = fix) for 4 = Q, 2, 
g(x, t) = g(a, t) for (2, t)eS"" 4 = 0, 2. 
Writing fi,.(x) = gi(a, t) for te H andi = 0, 1, 2 we have for 7, 7 = 0, 1, 2 
fia |S" and (fi, fd) ~ (Sit, 


In particular we have d(f;, f;) = d(fia, fj) and since fi; = 0, (8.2) reduces 
to the formula 


d(for, 0) + for) = , fe), 


which is a direct consequence of the definition of 7, . 

Ap (8.3). Follows from (8.1) and (8.2) taking fe = fo. 

Ap (8.4). Let g«Y*" be a map representing the clement aez,. Since 
g | Et ~ 0 and fo ~ 0 we have 


g | Et fohy. 
Using (5.3) we find ag’ « Y*” such that 
=~g and g’| Et = foh,. 


Writing = we have f, Y*", fo | =f, | andg’ = (fo, fi). There 
fore d(fo, fi) = a. 


| 
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III. THe GenerAL THEORY 


9. We assume that Y is n-simple. All chains considered here will have coeffi- 
cients taken from a, = 7,(Y). 

Let fe For each (n + 1)-cell the map f|(o"')’, the n-sphere 
(o""') = — o”™ being oriented by the n-cycle defines uniquely an 
clement of x, which we denote by c(f, o”™). We define an (n + 1)-chain 
(f) writing 
(9.1) ef) = of 


n+l 


Obviously c(f, ¢"”) = 0 is equivalent with f | (c""’)’ ~ 0 and therefore 


(9.2) c(f, = 0 of and only Gf there is an extension f’ of f. 


(93) Iff'eY*™ andf (f) = 

Let fo, fie fo| =f,|K”™. For each n-cell o” the maps fo | and 
fi|@" define according to 8 an element d(fo, fi, 0”) of a. We define” an 
n-chain d"(fo , fi) writing 


(9.4) a"(fo, fi) = fr, of o?. 
It follows from (8.1) that 
(9.5) d(fo,fi,o") 0 af and only if fo |e" ~fi | rel." — o". 


10. We shall prove the following fundamental properties of c”*'(f) and 
d"(fo, fi): 
(10.1) is a cocycle (i.e. (f) = 0). 
(10.2) éd"(fo, fr) = (fo) — 
(10.3) If fo, fri, feeY™ and fo|K™* = = then 
d"(fo Ji) + a"(fi fo) d"(fo 
(10.4) d"(fo, fi) = —d"(fi, fo) 
(10.5) Given fo « Y*" and an n-chain d” there is an f, « Y*" such that fo | K"* = 
fi| and that d"(fo , f:) = a”. 
Proors. (10.3) follows from (8.2); (10.4) follows from (8.3) and (10.5) is an 
immediate consequence of (8.4). 
Ap (10.2). Let of be an n-cell. We suppose first that 


(10.6) fo| K” of =fi|K" —o. 
We then have 
a" (fo = fi, , 


and therefore 
n+ 


bd" (fo fi) = fr, of do? = d(fo, fi, oF) 
According to footnote? we obtain c"*1(f) by considering c(f, as a function of the 
‘’ See the paper of L. Pontrjagin quoted in °. 
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If a7;** = 0, then because of (10.6) we have fo | (o7"")’ = fa| (of). It follows 
that c(fo , = c(f, , **), and (10.7) holds. 

If 0 we have of Let h(@?) = (07 — of be a homeomor- 
phism such h(x) = a for x € (o7'). Because of (10.6) we have 


(10.8) foh = fih. 
It follows from 8 that for k = 0, 1 we have 
= d(fe, Sah, of) if = 1, 
c(fe, = d(fuh, fe, o7) if = —-1. 
Using (10.8) and (8.3) we obtain 
e(fo, — = [d(fo, foh, + d(foh, fi, o7)]. 


(10.7) follows, from (8.2). 
Now, let of, o2 , ---,0; be the n-cells o , where oj is an arbitrary 
(n + 1)-cell. We the functions go, 91, , gr Y*" as follows: 


go = fo, 
gi | K" — of = gin|K" — of =f, | fori > 0. 
We then have fori = 0,1,---,r—-1 


gt ti 1 
fé n+ + 


bd"(9i = (gi) — 
and by (10.3) 
(10.9) bd"(fo , gr) = — c”*(gr). 
But since g, | (o?*') = fo| (o? we have 
=Afo,fi,o7) for of 


Consequently we deduce from (10.9) that ¢(fo, 07’) — c(fi, is the coeffi- 
cient of o}** in the (n + 1)-chain 6d"(fo, fr), and hence (10.2) is completely 
proved. 

Ap (10.1). Let o”** be an (n + 2)-cell and o”*! C 6"* an (n + 1)-cell. 
By the lemma of 3 there is an n-chain d” such that 


(10.10) c""(f) +A" AM aem. 


According to (10.5) there is an f’ such that f| = f’| and 
d"(f, f’) = d". Therefore, by (10.2), 


(10.11) ef) — 6d” = 


| 
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It follows from (10.10) and (10.11) that c(f’, = and that c(f’, =0 
for each (n + 1)-cell of? Cae"** — 0". By (9.2) we may therefore suppose 
that f’ is extended on (o"**) —o""'. This complex being an (n + 1)-element it 
follows f’ | (o"*")’ ~ 0 therefore c(f’, «""") = 0 and therefore a = 0. This and 
(10.10) imply 


and therefore 
= — CK — 
o"” being an arbitrary (n + 2)-cell it follows that éc""'(f) = 0. 


11. Let f e Y* **", where K’ is a fixed closed subcomplex of K. 
(11.1) CK — K’. 
(11.2) c"(f) = 0 of and only if there is an extension f' « Y*'**"™" of f. 
In fact, for each o”*' C K’ we have f|@"*' ~ 0, whence c(f, o"*') = 0. 
Therefore (11.1) holds. (11.2) is a consequence of (9.2). 
Let fo, fie fo| KK’ + =f,|K' + 
(11.38) d"(fo, fi) CK — KR’. 
(11.4) d"(fo, fr) = 0 of and only if fo ~ fy rel. K’ + K"™. 
For each o” C K’ we have fo|¢" = fi | ¢" and from (9.5) we deduce 
d(fo, fi, 7") = 0. Therefore (11.3) holds. (11.4) follows from (9.5). 
(11.5) Given fo « Y*'**" and an n-chain d” CK — K’, there is an f, « Y*'**" 
such that fo| K’ + K” = f,| K’ + K"™ and that d"(fo, fi) = a". 
(11.6) Given f Y*'**" and a cocycle C K — K' such that 
in K — K’, there is anf’ « Y*'**" such thatf | K'+ K"'=f'|K+K"" 
and that c"**(f’) = 
Proors. (11.5) is an immediate consequence of (8.4). In order to prove 
(11.6) let us consider an n-chain d" C K — K’ such that 


By (11.5) there is an f’ e Y*’t*" such that f | K’ + K"" = f’| K’ + K"” and 
that d"(f, f’) = d". According to (10.2) we then have 


and c"*"(f’) = follows. 


12. Exrension I. Let fe The (n + 1)-chain de- 
jined by (9.1) is a cocycle in K — K’. Moreover, c"*'(f) —0 in K — K’ if and 
only if there is an f’ Y*'**"*" such that f| K’ + = f’|K’ + K"". 

PRoor. The first part of the theorem follows from (10.1) and (11.1). If 
K — K’ then applying (11.6) for c"*? = 0 we find an f” « 
such that f | K’ + = f’| K’ + and that c"*'(f”) = 0. By (11.2) 
there is an extension f’ ¢« Y*’**"*" of f”. We then have f|K’ + K"* = 
f’|K + K"*. On the other hand, if such an f’ exists we have: 6d"(f, f’) = 


or- 
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— because of (10.2), = 0 because of (11.2), and 
d"(f, f’) K — because of (11.3). It follows that c"*"(f) —0in K — K’. 

Homotopy Turorem I. Given two functions fo, fie such that 
fo | kK’ = hh | K’ the n-chain ad" (fo fi) defined by (9.4) is a 
cocycle in K — K'. Moreover, d"(fo, fi) — 0 in K — K’ af and only if 
(12.1) fo| K’ + K" | K’ + K" rel. K’ + K"”. 

Proor. We shall consider the interval F as a complex containing two 0-cells 
0 and 1 and one 1-cell ¢ oriented so that deg = 0 — 1. The product L = K x E 
is considered as a complex with cells of the form o” X 0, o” * 1 and o" X «. 
The n-cell c” X 0 is oriented as o”, 0” X las —o". The (n + 1)-cell o” X «¢ 
is oriented so as to have X = Xe+o” X de. 

Let LD’ = K’XE+KX0+KX1. Letge Y”*” be the map defined by 


g(a, t) = for weK’'+K"", 
g(z,t) = for xreK, = 6, 1. 


By (9.2) we then have c(g, o”*’ X 0) = 0 and cg, «”** X 1) = 0. It follows 
from 8 that c(g, o” X «) = d(fo, fi, 0"). Therefore we have 


= d"(fo, fi) X 


The condition d"(fo , fi) — 0 in K — K’ is therefore equivalent with the condi- 
tion c""'(g) —O0inL — L’. By Ext. th. I this is equivalent with the existence 
of such that g| L’ =g'| L’+L"". This, however, means 
exactly (12.1). 

Homorory Ia. Jf "H,,(K — K’) = Oand fo, fi Y* then 


(12.2) fo| K'+ ~f,| + rel. A 
implies 
(12.3) fo| K’ + K" ~f, | K’ + K” rel. A 


for any subset A of K. 

Proor. By (5.2) and (12.2) there is an fo « Y* such that fo | K’ + K"' = 
fi| + K"" and that fo~forel. A. Since d"(fo , fi) in K — K’ it follows 
from Hom. th. I that fo | K’ + K” ~fi| K’ + K" rel. A. This implies (12.3). 


13. In this section Y is 7-simple for 7 = n,n + 1, --- , dim (K — K’). 
Extension II. Jf — K’) = 0 fori =n,n+1,-::, 
where = then every f « which has an extension f’ « Y*'**" has also 
an extension « Y*. 
Proor. We define a sequence fn = f’, fn42, Of maps 
such that fiz: | K’ + =f; | K’'+ The sequence exists by Ext. th. I. 
Writing f’’(x) = lim f;(x) we have f” « Y* and f| K’ = f’| K’ = f’ | K. 
Homotopy Turorem II. Jf ‘H,,(K — K’) = Ofori =n,n +1, ---, where 
mw; = mi(Y), then fo, fie Y* and fo| K’ + K”' ~f,| K’ + K"" rel. A imply 
fo = fi rel. A for any subset A of K’. 
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Proor. LetL = K XE, L’ = K’ XE+KX0+K We then 
have '"H,(L — L’) = 0 fort = n,n + 1,---. It follows by Ext. th. II 
that every ge Y” which has an extension g’¢ YY” *”” also has an extension 
” rl 


If fo| K’ + ~fi| K’ + rel. A, there is ag’ such that 
t) = for wreA, te E, 
g'(x,t) = fix) for i=0,1. 


Writing g’ | L = g, we see that there is an extension g” ¢ Y” of g. This implies 
fo~/hirel. A. 


IV. Tue Case 7;(Y) = 0 ror i < n. 


14. In this part we assume that 7;,(Y) = 0 fori <n. If n > 1 this implies 
m = Oand by (7.3) Y is7-simple for allz. Ifn = 1 we assume that Y is 1-simple, 
or (see (7.1)) that a is abelian. In particular everything can be applied for 
Y = 

(14.1) Every f Y*' has an extension f’ Y*'**", 
(14.2) fo, fie and fo|K’ ~ fi|K’ rel. A imply fo| KK’ + K"* ~ 
fi| K’ + K"” rel. A for any A CK’. 

Proors. Obviously there is an extension fo € of f. Since Y is arewise 
connected (see 7) there is also an extension f,; « Y*’**' of f. Since x; = 0 for 
i <n, we have “'H,,(K — K’) = 0 and applying Ext. th. I we obtain suc- 
cessively extensions fix, for i = 1, 2,---,n — 1. 

Y being arewise connected, it follows from fo| K’ ~ fi| K’ rel. A that 
fo| K'+K°~fi| K’+ K’ rel. A. Since ‘H,,(K — K’) = Ofori = 1,2,---, 
n — 1, we obtain fo| K’ + KK” ~f,| K’ + K”™ rel. A applying Hom. th. I a. 


15. 
(15.1) fo, fre and fo| K’ ~f, | K’ imply ~ (fi) in K — K’. 

Proor. By (14.2) we have fo| K’ + K”’ ~f, | K’ + K””, and according 
to (5.2) there is an foe Y*'**" such that fo ~ fo and fo|K’ + K"" = 
fi|K’ + K"”. From (9.3), (10.2), and (11.3) we then have: c"*"(fo) = 
6d" (fo fr) = (fo) — and d"(fo , fi) C K" — K’. It follows 
that (fo) (f,) in K K’. 

By (14.1) every f Y*’ has an extension f’« Y*'**". Let be the 
element of "*'H,,(K — K’) determined by the cocycle c"™'(f’). It follows from 
(15.1) that the choice of f’ does not matter and that 


(15.2) fo, fie and fo ~ fi imply c”*"(fo) = (fi). 


From Ext. th. I we obtain the following 
Exrension TuroreM III. Given we have = 0 if and only if 
there is an extension f’ ¢ Y*'**"™ of f. 
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16. 
(16.1) Given fo,fisfo, such that fo| K’ + K”'=fi| K'+K"', ff | 
K’'+K"" = fi| K’ + K"", fo = forel. K’ and fy ~ fi rel. K’ we have 
d" (fo d"(fo in K — K’. 
Proor. Let L= K Letge 
be defined by 


g(x, t) =folx) for xe K’ + te E, 
g(x, t) = fi(x) for 0,1. 


In an analogous way we define g’ « Y“'*"" using fo and f; instead of fo and f,. 
As in 12 we then have 
(16.2) eg) Xe =d(fo, fi) X«. 

Now, fo ~ fo rel. K’ and f, ~ fi rel. K’ obviously imply g | L’ ~ g’ | L’, and 
therefore, by (15.1), e"*'(g) —c""'(g’) in L — L’. Using (16.2) we then obtain 
d"(fo , fi) —a"(fo, fi)in K — K’. 

Given two functions fo, fi, such that fo | K’ = fi | K’ we have, by 
(14.2), fo| K’ + K"’ ~ fi| K’ + K"" rel. K’, and by (5.2) there are two 
functions fo , fi Y*, such that 


(16.3) fo~forel. K’, fi~firel.K’, fo| K’ +K"'=fi|K’+K"". 


Let d"(fo, fi) be the element of "H,,(K — K’) determined by the cocycle 
d"(fo, fi). It follows from (16.1) that d”(fo, f:) is independent of the particular 
choice of fo and f; such that (16.3) holds. By (10.3) and (10.4) we have 


Tf fo, fi, foe and fo | K’ = fi | K’ = fo | K’ then 
a"(fo, fir) + d"(fi, fo) = d"(fo, fr), 
(16.5) d"(fo fi) = —d"(f, ’ fo). 

Homotopy III. (Given fy, fi Y* such that fy | K’ = fi | K’ we have 
a" (fo , fi) = 0 af and only if 
(16.6) fo| K’ + K” ~fi| K’ + K" rel. K’. 

Proor. If d"(fo, fi) = 0 then d"(fo, fi) — 0 in K — K’ and by Hom. th. I 
we have fo | K’ + K" ~ fi | K’ + K" rel. K’. Using (16.3) we obtain (16.6). 
On the other hand, by (5.2), (16.6) implies the existence of an fo « Y* such that 


fo fo rel. K’ and that fo | K’ + K" = f,| K’ + K". It follows d"(fo, fi) = 0 
and therefore d"(fo , fi) = 0. 


(16.4) 


17. In this section (and in 18) we assume as before that 2;(Y) = 0 fori < n. 
If nm = 1 we assume that Y is 7-simple for i = 1, 2, -.. , dim (K — K’). As 
before, everything can be applied for Y = S". 

Combining Ext. th. II and III we have the 

Extension THEoREMIV. Let — K’) =0 
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where = wi(Y). Given f we have c"*'(f) = 0 if and only af there is an 
extension f’ Y* of f. 

Similarly Hom. th. II and III imply 

Homorory THEoREMIV. Let ‘H,,(K — K’) =0 fori=n+1,n+4+2,--.-, 
where = Given fo , fi such that fo | K’ = fy | K’ we have d"(fo , fr) = 
0 if and only if fo ~ fi rel. K’. 

The hypotheses of Ext. th. IV and of Hom. th. IV are obviously satisfied if 

Existence THEorEM. Let — K’) = 0 fori=n+1,n+2,---, 
where = Given fre Y* and d" e«"H,,(K — K’) there is an fye 
such that fo | K’ = hi | and that d"(fo d”. 

Proor. Let d" be a cocycle of the cohomology class d". By (11.5) and 
(10.4) there is a map fo Y*'**" such that fo| K’ + = f,| K’ + 
and that d"(fo, fi) = d". By (10.2) we have 6d" = (fo) — c"*(f,). Since 
bd" = Oandc"*"(f,) = 0 (see (11.2)) it follows c"*"(fo) = 0 and by (11.2) there is 
an extension fo of fo. 

Now, let K” = K’+ K". We then have 'H,,(K — K”) = Ofori =n+1, 
n+ 2, --- and by Ext. th. IT there is an extension fy ¢ Y* of fy. We then have 
d"(fo = d" and therefore d"(f,’, fi) = d". 


18. Let f*e Y*. Asin 4 Y*(K’, f*) will be the sub-family of Y“ containing 
all fe Y* such that f | K’ = f*|K’. The family Y“(K’, f*) is divided into 
homotopy classes rel. K’, two maps fo , f: , « Y“(K, f*) being in the same class if, 
and only if, fo ~ fi rel. K’. 

Given a homotopy class® rel. K’ of Y“(K’, f*) we define d"() = d"(f, f*) 
where f e®. It follows from (16.4), (16.5), and Hom. th. III, that the element 
d"(6) of "H,,(K — K’) is defined uniquely. Under the hypothesis of Hom. 
th. IV we have @) = %, if and only if d"(4)) = d"(#,)._ Under the hypothesis 
of the Existence th. there is for each d" «"H,,(K — K’) a homotopy class ® 
rel. K’ of Y“(K’, f*) such that d"(®) = d". We obtain, therefore, 

CiassiFicaTION THEOREM I. Let ‘H,,(K — K’) = ‘“H,,(K — K’) = 0 
fori=n+1,n+ 2, .--, where = The elements of "H,,(K — K’) 
are ina (1 — 1)-correspondence with the homotopy classes rel. K’ of Y“(K’, f*). 
The correspondence is determined by the operation d"(f, f*). 

Taking K’ = 0 and f* = 0 we obtain 

CLassiFIcATION THEOREM II. Let 'H,,(K) = "'H,,(K) = 0 fori=n+1, 
n+ 2,--- where x; = 4(Y). The elements of "H,,(K) are in a (1 — 1)-corre- 
spondence with the homotopy classes of Y“. The correspondence is determined by 
the operation d"(f)(=d"(f, 0)). 

Note that according to 16 d"(f) is defined as follows: by (14.2) we have 
f|K"~* ~ 0, therefore by (5.2) there is an f’ « Y* such that f ~ f’ and that 
f'|K"*=0. d"(f) is then the element of "H,,(K) corresponding to the cocycle 
d"(f’) d"(f’, 0). 

The hypothesis of Class. th. I are obviously satisfied if K — K’ CK". Tak- 
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ing K = K” in Class. th. II we obtain the theorem of Whitney quoted in the 
introduction. 


V. APPLICATION TO HoMOLOGY 


19. Let G, H, Z be three abelian groups. If to each aeG and 8 eH there 
corresponds and a-8 in Z, and both distributive laws are satisfied, we say" 
G and H form a group-pair with respect to Z. If J is the group of rational 
integers then clearly G and J form a group pair with respect to G. 

Given a finite n-chain A” = }>; a,;o? with a; ¢@ and an arbitrary n-chain 
= di with B; e H we write 


A".B" = a:-B;. 
It is easy to see that 
aA**.B* 


for every finite (n + 1)-chain A”**. It follows that if A? and Af are finite 
cycles mod K’ (coef. @) and Bg and B7 are cocycles in K — K’ (coef. H) then 


Aj ~* Aj and By 


imply 
Ao-Bo = 


We see then that "H**(K) mod K’ and "Hy(K — K’) form a group pair with 
respect to Z. Similar relations hold for "H“(K) mod K’ and "H;(K — K’). 


20. We assume that 

1°) Y is locally connected in dimensions < n** 

2°) r(Y) = Ofori<n 

3°) is isomorphic with I. 

It follows from (7.3) and (7.1) that Y is n-simple. 

It follows from our hypothesis that z,(Y) can be considered as identical with 
the n‘" homology group “K"(Y) with integer coefficients.” Further, there is in 
Y an n-dimensional cycle Tg (coef. J) such that for every n-dimensional cycle 
I" (coef. G) in Y there is a unique a eG such that I” ~ aT%." Owing to this 
fact we may write "(°(Y) = G and in particular 7,(Y) = "K7(Y) = I. 


21. Letfe Y*’. For every finite cycle A” in K’ with coefficients in G we have 
f(A") ~ ald where aeG. 


18 See E. Cech, Ann. of Math. 37(1936), p. 684. 

19 See e.g. C. Kuratowski, Fund. Math. 24(1935), p. 269. 

20 W. Hurewicz, Proc. Akad. Amsterdam 38(1935), pp. 521-2. Explanations concerning 
homology in Y will be found there. 

*t See N. E. Steenrod, Amer. Jour. of Math. 58(1936), pp. 661-701. 
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The element a will be called the degree of f on A" and denoted by g(f, A"). It is 
obvious that Ag ~* A? in K’ implies g(f, Ao) = g(f, Ar) and therefore g(f, a") is 
defined for every a” « "H*“(K’) and is a homomorphic mapping of "H*“(K’) 
into G. Obviously fo, fie Y*’ and fo ~ fi imply g(fo, a") = g(fi, a"). 


Let f’ « Y*’**" be an extension of f. For any (n + 1)-cell o'** we then have 


according to 9 and 20 c(f’, o?*") = g(f’, d07*") which can be written 
ope" "(f’) = g(f', 
It follows that 
for any finite (n + 1)-chain A"** with coefficients in G. If, in particular, A""' 
is a finite cycle mod K’ then @A"*' C K’ and 
(21.1) A™ = g(f, 
Therefore, according to 15 and 19 
(21.2) = g(f, aa”"™") for every "'H*°(K) mod K’. 


22. Let fo, fie Y* and fo| K’ = fi| K’. For every finite cycle A" mod K’ 
(coef. G) fo(A") — f(A") is an n-cycle in Y, and therefore fo(A") — fi(A") ~ al 
forsomeaeG. Wewriteg(fo,fi,A”) =a. As before we verify easily that g(fo, 
fi, a") eG is a homomorphic map defined for every a" e"H*°(K)mod K’. Clearly 
fi, fi fo ~forel. K' and f, ~fi rel. K’ imply g(fo fi, @") = (fo a”). 

Now let us suppose that + = fi |K’ + K"". We then have 
according to 8, 9, and 20d(fo , fi , 07) = g(fo , fi , and as before 


A".d"(fo ? fi) g(fo ’ A") 
and therefore 
(22.1) A”.d"(fo, fi) = fy A”) 
for every finite cycle A" mod K’. According to 16 and 19 we then have 
(22.2) a"-d"(fo, fr) = g(fo, fr, a") for every a” «"H*°(K) mod K’. 
23. We assume now (besides the hypothesis on Y made in 20) that 
1°) K — K’ is finite, 
2°) G = R is the group of real numbers reduced mod 1. 


In this case the groups "H"(K) mod K”™ and "H,(K — K’) are orthogonal” and 
therefore every element of "H;(K — K’) can be considered as a character™ of 


* This group has to be considered as a topological compact group. See L. Pontrjagin, 
Ann. of Math. 35(1934), p. 908. 

* L. Pontrjagin, Ann. of Math. 35(1934), pp. 361-388; H. Whitney, Duke Math. Jour. 
3(1937), p. 40. 

*L. Pontrjagin, Loc. cit. 
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"H"(K) mod K’. By (21.2) and (22.2) we see that the cohomology classes 
cf) — K’) and d"(fo, fi) — K’), when considered as 
characters, are just equal to g(f, aa”"") and g(fo, fi, @").”” Using Ext. th. III 
and Hom. th. III we obtain therefore 

ExtTENSION THEOREM III*. Given f « Y“' we have g(f, aa"™') = 0 for every 
a‘! «"*1H"(K) mod K’ if and only if there is an extension f’ « Y*'**"™" of f. 

Homotopy THEorEM III*. Given fo , fi e Y“ such that fy | K’ = fy | K’' we have 
g(fo, fi, a") = 0 for every a" «"H“(K) mod K’ if and only if fy | K’ + K" ~ 
fil K’ + K" rel. K’. 


24. In order to obtain classification theorems in terms of homology we have 
to admit that 

1°) Y is locally connected in dimensions S dim (K — K’); 

2°) ri(Y) = Ofori <n; 

3°) m,(Y) is isomorphic with I. 

If n > 1 it follows from (7.3) that Y is 7-simple for all7. If nm = 1 we require 
Y to be 7-simple for 7 = 1, 2, --- dim (K — K’). In particular we may take 
Y = §”. 

The group 7; = 7:(Y) being countable at most for 7 < dim (K — K’), there 
is” a topological compact group p; orthogonal to 7;. In particular we may 
take p, = R. 

If we admit further that K — K’ is finite, then the groups ’H?‘(K) mod K”™ 
and ’H,,(K — K’) are orthogonal” and therefore the formulas 


mod K'=0, /H,,(K — K’) = 0 


are equivalent. 

Using the argument of 23 we may restate all the theorems of 17 and 18 replac- 
ing cohomology by homology. In particular we obtain 

CLASSIFICATION THEOREM I*. Let ‘H’‘(K) mod K’ = ‘*'H’‘(K) mod K’ = 0 
frni=n+1,n+2,.---. The characters of the growp "H*(K) mod K' 
are in a (1 — 1)-correspondence with the homotopy classes rel. K’ of Y*(K’, f*). 
To each f « Y“(K', f*) there corresponds the character g(f, f*, a”). 

CLAssIFICATION THEOREM II*.” Let = = 0 fori =n +1, 
n+ 2,---. The characters of the group "H"(K) are in a (1 — 1)-correspondence 
with the homotopy classes of Y“. To each f « Y* there corresponds the character 
g(f, a"). 


VI. 


25. Let Q' be a finite or infinite geometrical cell-complex which is an oriented 
r-dimensional combinatorial manifold.” The first barycentric subdivision 


%* This conclusion (and consequently also the theorems which follow) can be obtained 
even when K — K’ is infinite provided the groups ‘H®(K — K’) and ‘H7(K) mod K’ are 
orthogonal for i = 1, 2, ---. 

26 L. Pontrjagin, Ann. of Math. 35(1934), pp. 361-388. 

27 See S. Eilenberg, Compositio Math. 6(1939), p. 429. 

*8 See e.g. K. Reidemeister, Topologie der Polyeder, Leipzig 1938, p. 151. 


| 
A 
i 
| 
| 


COHOMOLOGY AND CONTINUOUS MAPPINGS 247 


Qi of Q’ is a simplicial complex, and using the simplices of Qi; we may define as 
usual the (oriented) dual (r — n)-cell for each n-cell of Q’. The dual cells form a 
cell-complex K" which is also an r-manifold and has Q; as a barycentric sub- 
division. 

If o) " is the (r — n)-cell of K” dual to the n-cell 77? of Q’ we write D(r?) = 
"and D*(o; ") = . More generally, given an n-chain. 


A" = 
we write 
D(A") = aio; 
and *{9(A")] = A“. It is well known that 
aD(A") = (—1)""D(6A").” 


Similar relations hold for 9*. We obtain thus (1 — 1)-isomorphisms 
= ""He(K’) and 9["H<(Q’)] = ""H°(K’). The inverse isomor- 
phisms are given by D*. 


26. P will stand for an arbitrary closed subcomplex of Q’. We shall denote 
by the subcomplex of K’ consisting of all the cells where ¢ P. 
(26.1) P CAP). 

(26.3) — is a closed subcomplezx of K’. 

(26.4) P = P' implies CK’ — 9(P). 

(26.5) D(A") for every n-chain A" in P. D*(A") CP for every n-chain 
A" in D(P). 

(26.6) K" — \(P) is a deformation retract” of Q’ — P.” 

(26.7) Every f has an extension f' 

(26.8) fo, fre and fo | — D(P) ~f, | — D(P) imply fo ~ fi. 


(26.1)-(26.5) follow directly from the definition. In order to prove (26.6), 
notice that K" — 9(P) consists of all simplices of the barycentric subdivision 
Q; of Q’ which have no vertex on P. (26.6) is therefore a consequence of the 
following quite general and elementary lemma: 

Let Q; be a geometric simplicial complex, P; a closed subcomplex of Q, and C(P;) 
the closed subcomplex of Q, consisting of all simplices of Q: which have no vertex on 
P;. The complex C(P;) is then a deformation retract of Q, — Pi. 

(26.7) and (26.8) follow from (26.6). 


27. We assume that Y is n-simple. All chains will have coefficients from 
= 


** See H. Seifert and W. Threfall, Lehrbuch der Topologie, Leipzig-Berlin 1934, p. 245. 

°° A CX is a deformation retract of X if there is a map g « X*** such that g(z, 0) = 2, 
g(x, 1) A fora andg(a, 1) = azforzeA. 

* See S. Lefschetz, Topology, New York 1930, p. 141. 
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Let P be a closed subcomplex of Q’ and let fe " Taking K’ = 
K’ — 9(P) we have according to (26.1) and (26.4) 


K’ + K" K’ Cc Q’ 


Write 
f| K’ + K", yf) 


(27.1) is a cycle in (with coef. in r,(Y)). 
(27.2) yor" andf imply =v” “(f’). 
(27.3) CP then there is an f’ « such that f x 
- 
(27.4) y "“(f) = 07 and only if there is an f’ « Y°”""* such that f ~ 
(27.5) Given an (r — n — 1)-cycle C P such that ~ inP, 
there is an f’« "such that y"""(f’) = and that 
Q’ - ~f' | Q’ - 
(27.6) ~ 0 in P if and only if there is an " such that 
f | Q’ - ~f' | Q’ 
Proors. Ap By (10.1) and (11.1) is a cocycle in 
Therefore, by (26.5), is a cycle in P. 
Ap (27.2). f ~f’ implies f | K’ + K"~f’|K+K". Therefore f, ~ fi, and 
by (9.3) = It follows that y~""(f) = 
Ap (27.3). implies fi) C D(Pi), whence = 0 
for every CD(P) — D(P;). (9.2) f; admits an extension on every such 


o?*' and therefore there is an extension f; ¢ ye Derr of fi. According 
(26.7) there is an extension f’ Y° of fi. We then have 
f|K -9(P"") =fi=f' | — and by (26.8) f | 

Ap (27.4). If y~""(f) = 0 then taking P; = 0 in (27.3) we obtain an 
such that f ~ f’|Q” — On the other hand if such an f’ 
exists we have y" = by (27.2) and therefore = 0 by 

Ap (27.5). Let = Diy""). Then ~ “(f) in P implies 
Diy") Dy" in D(P) and therefore fi) in K’ — K’. 
By (11.6) there is an f; Y*'**" such | K’ + =f, | K’ + and 
that = Since K’ + K" = ig there is, by (26.7), an 
extension f’ of fi. We then = De" = 
Since K’ + K"* = K’ — 9(P™") and f| K’ + K"" = f,|K + K™ = 
fi|K’+ K"" =f'|K+K"", wehavef | K’ — D(P™") = f’| K’ — 
and by (26.8) f|@" — P-" ~f’ |Q" — 

Ap (27.6). If y’~""(f) ~ 0 in P, then, taking y""* = 0 in (27.5), we obtain 
an f” such that ""(f”) = 0 and that — ~ 


* This definition of y~""“(f) is obviously equivalent with that given in the introduction. 
Cf. footnote 15, 
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f’|Q@ — Applying (27.4) we obtain an f’ such that 
f@ —-P "ef |Q’ — P”™. On the other hand, if such an f’ exists then 
f|K’ + ~f'|K’ + K"" since K’ + K"* CQ’ — By (5.2) 
we may therefore suppose that f | K’ + K"* = f’| K’+ K"". Since K’ + 
K’ CQ — P™”" and K’ + K"" CQ’ — P’"” it follows from Ext. th. I that 
| K' + K") —0inK — K’ = 9(P) and therefore that ~ Oin P. 


28. In this section we assume that 7;(Y) = 0 fori < nif n > 1, and that Y 

is 1-simple (i.e. that #(Y) is abelian) if n = 1. 

(28.1) Given f there is anf’ such that f ~ f’ | Q’ — P. 

(28.2) fo, “and fo|Q’ —-P~fi|Q' — Pimp fi~fi. 

(28.3) fo, fieY" and folQ’ P~filQ — P imply ~ 
(fi) inP. 

Proors. Ap (28.1). By (14.1) there is an f,¢« Y* **” such that f| K’ = 
fi | K’, where as before we take K’ = K’ — D(P). Since K’ + K” = K" — 
a(P’"*) there is, by (26.7), an extension f’ « Y°’-”""” of f,. We then have 
{| K’ — D(P) =f' | K’ — D(P) and therefore f ~f’ | Q’ — P by (26.8). 

Ap (28.2). fo|Q’ — P~fi|Q’ — P implies fo | K’ ~f, | K’ and by (14.2) 
fo| K'+ ~fi| K’ + Since K’ + K"" = K’ — 9(P"") it follows 
from (26.8) that fo ~ fi. 

Ap (28.3). fo| Q’ - P ~ fi | Q’ — P implies fp | K’ ~ f, | K’, and by (15.1) 
"(fo | K’ + K") (f, | K’ + K") in K’ — K’ = 9(P). It follows that 
(ft) in P. 

Let f e Y°"” and let f’ be given by (28.1). Let y” “(f) be the element of 
“"“1H™(P) determined by the cycle y”~""(f’). It follows from (28.3) that 
7 ""(f) is independent of the choice of f’. 


(28.4) ""(f) = D*fe""(f | K’)] where K’ = — D(P). 


(28.5) Given f « we have = and only if there isanf’ 
such that f ~ f’ | Q" — P. 

Proors. (28.4) follows straight from the definition of ~~" and c"*’. If 
¢""(f) = 0 then for every f” « Y°~”” "* such that f ~f” | Q’ — P we have 
~ 0 in P. It follows by (27.6) that there is an f’ « * such 
that f ~f” |Q’ — P ~f'|Q' — P. On the other hand, if such an f’ exists we 
havey""“(f’) = 0 by (27.4), and therefore y"""""(f) = 0. 


29. We assume now that #:(Y) = 0 fori < nif n > 1 and that Y is 7-simple 
for? = 
(29.1) Let” (P) = Ofori=n+1,n+2,---,7 — 1 where x; = 
Given f we have = 0 if and only if there is an f’ 
such that f ~ f’| Q’ — P. 
: This follows from (28.5) and (27.6) applied successively for n + 1, + 
yrs 
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30. Let G and H be two groups forming a group pair with respect to a group 
Z. Given a finite n-chain A" = >>; a:o7 in K’ with coefficients in G and an 
arbitrary (r — n)-chain B”" = >>; 6:7; " in Q’ with coefficients in H, we write 


x(A", = a6.” 


We obviously have 
(30.1) x(A", A” = A*-D(B = D*(A")-B 


It follows therefore from 19, 25, and 26 that "H*°(K’) mod K’ — 9(P) and 
'""H"(P) form a group pair with respect to Z. Similar relations hold for 
"H°(K') mod K’ — D(P) and” "H*"(P). 


31. We make the same hypothesis about Y as in 20. Let fe Y°”. By 
(21.2), (28.4) and (30.1) we have 
(31.1) = g(f, for every mod — 9\(P) 
Now, if P is finite and G = R (see 23 and footnote”) the complex )(P) 
is finite and "*'H"(K’) mod K’ — D(P) is orthogonal to """”""H’(P), since it is 
orthogonal to """H,[D(P)] (see 28). It follows therefore from (31.1) that 
considered as a character of mod K’ — 9)(P) is equal to 
g(f, aa"""). Therefore it follows from (28.5) that 
(31.2) Given f we have g(f, aa"**) = 0 for every «”*'H"(K') mod 
K’ — QP) if and only if there is an such that f = 
rig = F. 
Appenpix I. ON NormaL Mapprnes™ 
Let Y be an arbitrary topological space and let yoe Y. A map fe Y*" will 
be called n-normal if f | = 0 (i.e. if f(x) = yo for K"™"). 
Given an oriented n-cell o? in K", the map f | ¢? defines (if f is n-normal) 


uniquely an element d(f, o/) of 7,(Y). If n> 1 then z,(Y) is abelian and we 
may define the n-chain d"(f) and the (n + 1)-cocycle c”*(f) by 


a'(f) = Liid(f, of)o?, *(f) = ad"(f). 


c(f, will be defined as the coefficient of the (n + 1)-cell inc" (/). 
Given two n-normal maps fo , fi « Y* we take 


a"(fo, fi) = d"(fo) — a"(fi). 


Starting from the definition of 7,(Y) we can prove that c(f, o7*") = 0 if and 
only if f can be extended on o?**. It follows that ‘s 
d"(f) is a cocycle (i.e. c"*(f) = 0) if and only if there is an extension f’ « Y™" 

of f. 


38 The purpose of this appendix is to make clear the position of the results of H. Whitney 
(Duke Math. Jour. 3(1937), pp. 51-55) in the theory developed in this paper. The proofs 
are on the same lines as those of Whitney and may be left to the reader. 
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ip From this, statements analogous to those of 9-12 can easily be deduced. In 
in particular we have 

te ExTENSION THEOREM. Given an n-normal map f « Y*’, the n-chaind"(f) CK’ 
is part of a cocycle in K™ if and only if there is an n-normal extension f’ « Y*'**""* 
Oj J. 

Given two n-normal maps fy, fie such that 
fo | K' = fi | RK’, the n-chain d"(fo ? fi) ws a cocycle in K — K’. Moreover 
i) — 0 in K — K’ af and only if fol K’ + K" ~ K’ + K" rel. 


d K’ + K 4 
; 
II. Mappines or INFINITE (n + 1)-MANIFOLDS 
We assume that 7(Y) = 0 fori < nif n > 1 and that Y is 7-simple for 
y i=1,2ifn=1. In particular we may take Y = 8S”. + 
Let Q”*? be an infinite geometrical cell-complex which is an oriented (n + 1)- “4 
’) dimensional combinatorial manifold. 
2) CLASSIFICATION THEOREM. ‘The homotopy classes of Y°"™* are in (1-1)-corre- 
is spondence with the elements of the group 'H™"(Q""") where mn. = tn(Y). i 
t Proor. Let K”*’ be the dual of Q”*’. Since Q"™’ is a connected infinite | 
0 complex, it is easy to see that °H°(Q"*’) = 0 and therefore, by 25, that } 
"H(K""’) = 0 for every abelian group G. The hypotheses of Class. th. IT ; 
d are thus satisfied and the homotopy classes of Y*" = Y°"” are in a (1 — 1)- 
~ correspondence with the elements of "H,,(K”*’). By 25 this group is iso- 
morphic with ‘H™"(Q"*"). This proves the theorem. 
UNiverRsITY OF MICHIGAN 
ll * Whitney, loc. cit., p. 58. It is easy to verify that d"(f) is part of a cocycle in K if ih 
and only if c***(f) ~0in K — K’; ibid., p. 54. ih, 
1) 
). 
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WEAK TOPOLOGIES OF NORMED LINEAR SPACES © 
By Leon ALAOGLU 
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Introduction 


The concepts of weak convergence introduced by Banach [9, pp. 122, 133] 
are of exceeding interest, but in certain respects are rather restricted. Many 
of the results concerned with these concepts are in general valid only for 
separable spaces. It is natural, then, to turn to the general theory of limits of 
Moore and Smith [2] for a generalization. A unified theory of weak convergence 
is presented in Section 1, together with some of its consequences for weak con- 
vergence of functionals. Of these the principal result is Theorem 1:3, which 
states that the unit sphere of the adjoint of a Banach space is bicompact in the 
weak topology of functionals. This is a generalization of a result given by 
Banach [9, p. 123] which is valid only for separable spaces. 

Characterizations of spaces equivalent or isomorphic to adjoint spaces are 
given in Section 2, and are applied to a class of spaces considered by Dunford 
and Morse [13]. A result on universal spaces is given in Section 3. 

The remainder of the paper is devoted to the development of theories of 
integration and differentiation of abstract functions. Two theories of integra- 
tion are outlined in Section 4, both of which can be obtained from that of 
Birkhoff [11] by replacing unconditional convergence of series by its analogues 
for weak convergence of functionals and elements. The second of these integrals 
can be specialized from that of Gelfand [15] by the addition of a uniformity 
condition. 

In Section 5 a theory of differentiation of functions with values in an adjoint 
space is developed. Instead of defining the derivative as the limit of the 
difference quotients of the function, the class of all ultimate limit points of the 
difference quotients in the sense of weak convergence of functionals is considered. 
For a function f of bounded absolute variation in the sense of Definition 5:1, 
it is shown that these sets are non-null almost everywhere, so that they define 
a multiple-valued function, the “derivative” of f. By using the first of the 
integrals defined in Section 4, it is shown that the fundamental theorem of the 
calculus holds for ‘derivatives’ of absolute continuous functions. Gelfand’s 
result on weak derivatives [15], and the results of Clarkson [12], Dunford and 
Morse [13], Gelfand [15], and Pettis [18] on strong derivatives are derived from 
this result. It is of interest to note that in the derivation of Clarkson’s theorem, 
it is shown that uniformly convex spaces are regular. 


1The numbers in brackets refer to the bibliography. 
252 


| | 
H 
aif ug 


— 


WEAK TOPOLOGIES OF NORMED LINEAR SPACES 953 


These theories of integration and differentiation are finally applied to obtain 
two representation theorems, one for linear functionals on the space of Bochner 
integrable functions, and the other for linear transformations on the space of 
Lebesgue integrable functions to an arbitrary Banach space. 


1. Weak convergence 


It has been pointed out that in the theory of normed linear spaces there 
arise two concepts of weak convergence different from convergence in the norm. 
If the space E* is adjoint to a Banach space E, a sequence (x,) of elements of E 
is said to have the weak limit x if lim, f(a,) = f(x) for every element f of E* 
(9, p. 133]. Similarly, a sequence (f,) of elements of E* is said to be weakly 
convergent if lim, fn(x) exists for every element 2 of E [9, p. 122]. In adjoint 
spaces these two types of convergence are not always identical. 

Banach has used these two types of convergence notably in the study of what 
he has called regular closure. His results in this connection are valid only for 
separable spaces. To remove these separability conditions, we make use of the 
general theory of limits of Moore and Smith [2]. 

The central concept of this theory is that of a directed set. This name is 
given to a class A of elements a, together with a binary relation R between the 
elements of A, if the relation R is transitive, and if further, for every pair a , ae 
of elements of A, there is a third element a such that the relations aRa; , aRae , 
hold. Limits of functions (b.) on A to the real numbers are defined in the 
following manner. 

DEFINITION 1:1. 

(1) lim supe ba = infa (SUPa,za 

(2) lim ba = —lim supe (—ba); 

(3) b = lima ba tf for every « > 0 there is an a, such that |b — ba| < ctf 
aRa, holds. 

These limits are well-defined, and lim, ba is a linear functional; that is, lim. 
(aaa + bbz) = alim, a. + blim,b.. We shall use the term “directed system 
of elements of E’’ for a function on a directed set A to a Banach space E, and 
we shall not specify the directed set whenever it is arbitrary. 

DeFinition 1:2. If G is a total subset of the space E* adjoint to E, and tf (xa) 
is a directed system of elements of E, we shall say that the system is weakly con- 
vergent on G if supa || Le || is finite, and tf lima f(a) exists for every f of G. If 
for some element x of E, f(x) = lima f(xa) for every element f of G, we shall write 


x = w-lim, on G. 


If G is E*, and if the system (z,) is a sequence, this reduces to Banach’s 
definition of weak convergence of elements. For weak convergence of func- 
tionals, the subset G of E** can be taken to be the set Ep of all functionals 
¢(f) = f(x), where z is in E. 

Another example is due to Schauder [6, p. 61]. In a space E with a basis 
(é,) let @ be the set of coefficient functionals in the development x = on @n(X)&n - 
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The sequences weakly convergent on G are then weakly convergent in Schauder’s 
sense. 

The condition that G be total in Definition 1:2 is natural; for otherwise limits 
would not be unique. The condition that the directed system be bounded is 
superfluous if the system is a sequence, and if G is E*; for arbitrary directed 
systems and arbitrary G this need not be so. For, if E is infinite-dimensional, 
a discontinuous, real-valued linear functional ¢ on E can be constructed by the 
use of a Hamel base [9, p. 231] for ZH. Let A be the class of all finite-dimen- 
sional linear subspaces a of HZ. Then A with the inclusion relation > is a 
directed set. But the section of ¢ on @ is continuous, and hence is the section 
on a of a linear functional f, continuous on HE. Thus ¢(x) = lim, f.(z) for 
every x; but the system (f.) is not bounded in view of the next theorem. 

TuroreM 1:1. If the directed system (fa) of elements of E* is weakly con- 
vergent on E, and if f(x) = lima fa(x), then f is linear and continuous on E, and 
|| lim inf. || fo ||. 

The linearity of f is a consequence of the linearity of the limit operation; as 
for the continuity of f, we see that | f(a) | = lima | fa(x) | S lim infz || fo ||-|| |). 

The weak convergence on G of Definition 1:2 defines a topology in E, in which 
the bounded subspaces of FE are completely regular. Let K be a bounded sub- 
set of ZH. Then if o is a finite subset of G, « > 0, and x an element of K, let 


(1:1) = [all 2, in K such that | — 2) | < ¢ for every g in 


THEOREM 1:2. The bounded subspace K of E with neighborhood system (1:1) 
is a completely regular Hausdorff space of character not exceeding No times the 
cardinal number of G. This topology is equivalent to the topology of K defined by 
the directed systems of elements of K weakly convergent on G to limitsin K. Equiva- 
lent neighborhood systems are obtained if G is replaced by any set having the same 
linear closed extension as G. 

By the character of a space is meant the least cardinal number of a basis of 
the open sets in the space. 

It is readily verified that G can be replaced by its linear extension Gz in Defini- 
tion 1:2 without affecting the character of the convergence of any system. 
The proof of the statement that G can be replaced by its metric closure (c is 
similar to the proof of Banach’s Theorem 1 [9, p. 133]. 

Let the element x of K be the weak limit on G of the directed system (a) 
of elements of a subset Ky of K. Then for every g; in a finite subset o of G, 
and for every « > 0, there is an a; such that | g;(z7 — ra) | < € if aRa; holds. 
But there is an a, such that aRa; holds for every 7, and then 2. is in U..(2) if 
aRa, holds. Conversely, if every neighborhood U,.(x) of the element x of K 
contains an element i. of Ko, = w-limi.tiee , Where (ce)R(o1e,) means 
that U..(x) is contained in U,,.,(z). 

We can now replace G by the set Gp of all functionals of the form g/M || g ||, 
with g in G; and M can be chosen so that | go(x) | S 1 if x is in K and gp in Go- 
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Now consider the class Q of all functions ¢ on Gp to the closed interval (—1, 1). 
The class Q is a bicompact Hausdorff space with the neighborhoods 


(1:2) Use) = [all ¢: in Q such that | ¢(g) — ¢:(9) | < ¢ for every g of o], 


where o is a finite subset of Go, « > 0, and ¢ is an element of Q. This space 
has been studied by Tychonoff [8, p. 546], who proves these results, and shows 
that the character of Q is No times the cardinal number of G. But, on com- 
paring the neighborhoods (1:1) with those of (1:2), we see that K is a subspace 
of Q, and hence is completely regular, as is shown by Tychonoff [8, p. 545]. 
Also, the cardinal number of Go does not exceed that of G. 

THEOREM 1:3. The unit sphere K of the space E* adjoint to E, with the neigh- 
borhoods (1:1) for G = E, is a bicompact Hausdorff space. Its character is No 
times the least cardinal number + of a subset of E total for E*. 

Banach has shown that the linear closed extension of a subset H of EF total 
for E* is the whole space E [9, p. 58]. Hence, by Theorem 1:2, E can be re- 
placed by a total subset Go of the unit sphere of E in defining the neighborhoods 
(1:1). K is then a subspace of the bicompact space Q of Tychonoff. More- 
over, K is closed in Q, so that K is bicompact. For, let the element f of Q be 
in the closure of the subspace K. Then every neighborhood U,,(f) contains 
an element f,. of K, and as in the proof of Theorem 1:2, f(x) = lim,. f..(x) for 
every cof Go. But then by Theorem 1:2, the system is weakly convergent, and 
by Theorem 1:1, it has a limit in K. 

Since Gp can be chosen to have minimal cardinal number 7, the character x 
of K does not exceed 7®&). The local character x; of K at f = 0 does not 
exceed x; and Alexandroff [5, pp. 269-70] has shown that from the complete 
system U,.(0) of neighborhoods of f = 0 a complete subsystem V,, can be 
chosen of minimal cardinal number x;. The character x; is infinite, so that 
we can extend the range of ¢ in the V,, to include all positive rational numbers. 
Let H, be the logical sum of all the sets o occurring in the V,.. Then, since 
x: is infinite, the cardinal number of H, does not exceed x;. Furthermore, A; 
is total. For, if the element f of K vanishes on H, the sequence f, = f con- 
verges weakly both to f and to zero, so that f = 0. Hence x = 7No, since 
™ Sx Sx 

When £ is separable, 7 < No, and the space K satisfies the second count- 
ability axiom. Theorem 1:3 has been obtained with this restriction by Banach 
[9, pp. 123-4]. 

A similar restriction occurs in Banach’s treatment of the regular closure. 
We recall here the definitions of regular closure and transfinite closure [9, pp. 
115 ff.]. 

Derinition 1:3. (1) If a limit ordinal, the transfinite sequence (fz | § < 
of elements of E* is said to have the transfinite limit f if the system (f;) is bounded, 
and if for every element x of E, 


lim inf; fe(z) f(x) S lim sup; f;(z). 
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(2) If every bounded transfinite sequence of elements of a subset G of E* has a 
transfinite limit in G, G is said to be transfinitely closed. 

(3) The subset G of E* is regularly closed if for every functional f not in G 
there is an element x of E such that f(x) ¥ 0, while g(x) = 0 for every element 
g of G. 

(4) The subset G of E* is said to be weakly closed on E af every directed system 
of elements of G converging weakly on E has a limit in G. 

Banach [9, pp. 121, 125] has shown that these three closure properties are 
equivalent for linear subspaces G of H* if EZ is separable. In the next theorem 
we show that this equivalence holds in arbitrary Banach spaces. 

THEOREM 1:4. If the subspace G of E is linear, the following properties of G 
are equivalent: 

(1) G is transfinitely closed, 
(2) G is regularly closed, 
(3) G is weakly closed on E. 

The first implication, that (1) implies (2), has been proved by Banach 9, p. 
119]. To show that (2) implies (3), let f be the weak limit on F of a directed 
system of elements of G. Then, if g(x) vanishes for every element g of G, so 
does f(x), so that fisin G. Finally, to prove that (3) implies (1), let (f;) be a 
bounded transfinite sequence of elements of G. Then the sets 


G, = [all fe with » < = < 6] 


lie in a metrically closed sphere K, which is bicompact in the weak topology 
of Theorem 1:3. Hence the intersection of the weak closures of the sets G, is 
non-null [4, p. 259]. Any element of the intersection is in G, and is a transfinite 
limit of the system (f:). 

THEOREM 1:5. If the functional F is linear and continuous on E*, and if when- 
ever the system (f.) of elements of E* converges weakly to f on E, it is true that F(f) = 
lima F(fa), then there is an x in E such that F(f) = f(xo) for every f of E*. The 
converse is also true. 

The proof of this theorem is the same as that of Banach’s Theorem 8 [9, p. 
131], which was proved for separable spaces. 


2. Two characterizations of adjoint spaces 


THEOREM 2:1. If E is a Banach space, the following two conditions on E are 
equivalent: 

(1) There exists a Banach space E, such that E is isomorphic [9, p. 180] to E ; 

(2) There is a total subset G of E* and a constant M > 0 such that for every 
element x of E, || x || S M supying, | f(x) |/\|f ||, where Gx is the linear extension 
of G; and every directed system (xa) of elements of E weakly convergent on G has a 
limit in E. 

If (2) holds, for every linear and continuous functional F on the linear closed 
extension Gi of G there is an element x of E such that F(f) = f(ao) for every f 
of Gic , and such that || x || < M-|| F |]. 

If E is isomorphic to Ey , let 7' be the linear transformation on E to Ey which 
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is fective. Then || 7 || S| Tell T ||. Let @ be the set 
of all functionals f,(z) = (7'x)(y) with y in E,. This set is total, since 
(T'x)(y) = 0 for every y if and only if 7x = 0, which is equivalent to x = 0. 
To obtain the inequality involving the norm in E, we observe that | f,(x) | = 
\(Tx)(y)| ||, so that || f, || ||-\| y ||. Hence we see 
that 


supy | fu(z) |/|| fy || 2 supy | T 


The system (xq) converges weakly on G if and only if the system (7'x,) con- 
verges on E;. But, by Theorem 1:1, this latter system has a limit g in gE. 
and 7” 'g = w-lim, 22 on G. This completes the proof of the necessity of (2). 

If (2) holds for G, it also holds for the linear closed extension G,¢ of G, by 
Theorem 1:2. The functionals ¢.(f) = f(x) are linear and continuous on Gr 
and the totality of these functionals is a total linear subspace of (@ic)* which 
is weakly closed on G. Hence, by Theorem 1:4, it is the whole space (Gic)*, 
so that every F in (Grc)* is ag, , and || z|| S M supyinoze | |/\|F || = 
M \\¢, ||. This transformation carrying x into ¢, is the isomorphism between 
E and (Gic)*. 

Corotuary 2:1. If in (1) of Theorem 2:1 the word “isomorphic”’ is replaced 
by “equivalent” [9, p. 180], (2) may be replaced by: 

(2’) There is a total subset G of E* such that || x || = supy ine, | f(x) |/\\f\I, 
and such that the unit sphere of E is bicompact in the topology of Theorem 1:2 
defined by G. 

If (2') holds, E is equivalent to (Gic)* under the transformation indicated in 
Theorem 2:1. 

In this case, 1 = || 7’ || = || 7” ||, and from the construction of the set G, 
the bicompactness follows from Theorem 1:3. To prove that (2’) is sufficient, 
note that the transformation at the end of Theorem 2:1 is an equivalence in 
this case. 

THEOREM 2:2. If the Banach space E has a basis (é,) such that the series 
Yin Qnén converges in the norm whenever its partial sums are bounded, then E is 
eee to (Gic)*, where G is the set of coefficient functionals in the development 

Spaces of this type have been considered by Dunford and Morse [13, p. 415], 
who showed that the space has an equivalent norm such that for every sequence 
(a;), the sequence (|| D 7.1 @;&; ||) is non-decreasing. Since G is denumerable, it 
suffices to show that the unit sphere of E is weakly compact in the topology 
defined by G. If (2,) is a bounded sequence, the sequences (a;(rn) |) are 
bounded for each 7, so that, by the diagonal process, a subsequence (z,,) can be 
chosen such that a; = lim, a;(ap,) exists for every 7. The series >: aig; then 
converges to an element x of FE, and = w-lim,; zp, on G. Also 


$a 
ent 
em 
om 4 
p. 
ed 
80 
By 
4 
te 
n- 
= 4 
he 
p. 
J 
re 
| W 
ry 
a 
oll 
n n 


258 LEON ALAOGLU 


But isin and || af(é) || || f ||, so that condition (2’) of 
Corollary 2:1 is satisfied. 


3. Universal Banach Spaces 


Theorem 1:3 suggests tht possibility of constructing universal Banach spaces. 
Let K be the bicompact space of Theorem 1:3, and let ¢, = (f(z) | f in K). 
Then ¢, is continuous on K, and is linear in z. If we define || ¢, || to be 
supy in x | |, we see that || z || = ||, so that this transformation is an 
isometry between E and a subspace of the space of continuous functions on the 
bicompact space K. 

When K satisfies the second countability axiom, it is well-known that K is a 
continuous image of Cantor’s ternary set By,. But if the character x of K 
exceeds No , such a result is not known. Stone [14] has constructed the bicom- 
pact Boolean spaces {8, of character 7, consisting of all functions b on a range P 
of cardinal number + with values zero or unity. A basis of %, is formed by 
the finite intersections of the sets 


U, = [all b in &, such that b(p) = 0], 
= B — Uz. 


Stone has shown that a bicompact Hausdorff space of character 7 is a con- 
tinuous image of a closed subset of &,. Let 7 be the transformation effective 
for K. Then the functions ¥,(b) = ¢,(7'b) are continuous on &, linear in z, 
and ||x|| = sup» in@ | ¥z(b) |. This completes the proof of the following 
theorem. 

THEOREM 3:1. If the least cardinal number of a total subset of the adjoint E* 
of the Banach space E is 7, then E is equivalent to a linear closed subspace of the 
space of all continuous real-valued functions f on a bicompact Boolean space 8 of 
character , where f || = sups ing | f(0) |. 

When 7 S No, we can take & to be By, , as was done by Banach [9, pp. 185~7]. 
But if 7 > No, the space & is not uniquely determined by +. One might con- 
jecture that Stone’s theorem could be strengthened to read “every bicompact 
Hausdorff space of character 7 is a continuous image of &, ,” and then we 
would have a stronger Theorem 3:1. 


4. Integration of abstract functions 


Gelfand [15] has given a definition of an integral for abstract-valued functions 
T' on a domain P with a measure function. If for every element f of the adjoint 
E* of the range of 7' the real function f[7'(p)] is integrable in the sense of Lebesgue 
as defined by Fréchet [1], the function T is said to be weakly integrable. If the 
range of T' is the adjoint E* of a Banach space E, one might define 7' to be 
integrable if ¢.[7(p)] is integrable for every x, where, as before, ¢:(f) = f(2)- 
It is here that one can make a construction similar to that of Birkhoff [11]. 
Birkhoff’s construction makes use of the norm topology of the range of 7; a 
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similar theory using weak convergence is a natural extension. First it is neces- 
sary to have a notion of unconditional weak convergence of series. This is at 
hand. For use in the following definition, we observe that the class of all finite 
sets ¢ of positive integers is a directed set with the inclusion relation ¢; D oe. 

Dertnition 4:1. A series >>; f; of elements of the space E* adjoint to a Banach 
space E is said to be weakly unconditionally convergent on E if lime >>i in « fix) 
exists for every element x of E. 

This is the analogue for weak convergence on F’ of Orlicz’ notion of uncondi- 
tional convergence in the norm [21, p. 242]. A similar definition has been given 
by Gelfand [16, p. 246]. 

TaeorEM 4:1. A necessary and sufficient condition that the series >~; fi of 
elements of E* be weakly unconditionally convergent on E is that 


supe fill < +=. 


If this be so, w-lim, >i ino fi exists on E. 

The class of all o is denumerable, so that, if the condition of the definition is 
satisfied, the limit exists semi-uniformly. Hence, by Hildebrandt’s Theorem 
[3, p. 313], there is a o; and a constant M > 0 such that if o contains o,, 
|| Si || < M. Then for any o, || Si || <M + || fi || holds. 
Conversely, if sup. || Dciine Si || < +, the series }>21 | fi(x) | is convergent 
for every element x of E, and lim, )>i ine f(z) = DoPafi(x). It is then a conse- 
quence of Theorem 1:1 that w-lim, >>; in« fi on E exists. 

One can now proceed to replace unconditional convergence by weak uncondi- 
tional convergence in Birkhoff’s definitions and results on the summation of 
series of bounded subsets of E*. At the same time the metric closure B must 
be replaced by the weak closure hB. This is defined to be the set of all limits 
of directed systems of elements of B weakly convergent on E. If B is a bounded 
subset of H*, the set hB is weakly closed on E in the sense of Definition 1:3 (4). 
All of Birkhoff’s results in this connection are valid when so amended. 

The functions to be studied are multiple-valued functions on a domain P of 
the type described by Birkhoff to the adjoint Z* of a Banach space E. Such a 
function is said to be weakly summable relative to the partition Tl = (A;) of P 
if the series of sets }>; T'(A;)mA, is weakly summable in the sense of weak con- 
vergence on £ as just indicated. If T is weakly summable relative to II, the weak 
integral range of T relative to II is defined to be Jy(T) = h[Co(>>; T(Ai)mA,)]. 
The function is then said to be integrable over P on E if it is weakly summable 
relative to some partition II, and if the integral ranges of 7’ have only one point 


in common, which is denoted by / T(p) dm. This integral is characterized 


in the following theorem. 

THEoREM 4:2. A function T on P to E* is integrable over P on E if and only 
if tt is summable relative to some partition II, and if the real functions $.[T(p)] 
are integrable over P in the sense of Lebesgue. 

Thus this integral differs from the modification of Gelfand’s integral indi- 
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cated at the beginning of this section by the additional condition of summability, 
which is a sort of uniformity condition. It can be shown that this condition of 
integrability is weaker than that of Birkhoff, and that when the latter holds, the 
integrals are equal. 

Much use has been made of the transformation y on E to E** defined by the 
relations ¥(z) = ¢,z, and ¢.(f) = f(x). By means of this transformation one 
other integral can be defined. A function 7 on P to the Banach space £ is 
said to be integrable over P on E* if the function ¥[7(p)] with values in E** is 
integrable over P on E*. By Theorem 4:2 this condition can be shown to be 
stronger than Gelfand’s, and weaker than that of Birkhoff. 


5. Differentiation of abstract functions 


In this section we shall consider single-valued functions on a real interval to 
the adjoint Z* of a Banach space £, and we shall be concerned with a theory of 
differentiation for such functions. As in the Lebesgue theory, a notion of 
bounded variation is necessary. 

DEFINITION 5:1. A single-valued function on the interval (c, d) to the Banach 
space E is said to be of bounded absolute variation tf there exists a number M such 
that for every partitionc = a Sa S--- S a, = dof the interval, it is true that 


lisa) fla) || Me. 


There is associated with each type of convergence of infinite series in a Banach 
space # a notion of function of bounded variation. The word absolute used 
here refers to this distinction. 

The derivative of a real-valued function is the limit f’(t) = limaeo [f(t + a) - 
f(t)|/a. If f’(t) exists, it is the only element of the set 


hfall + a) — f()]/a with |a| dl, 


where h denotes the closure of the indicated class of numbers. Conversely, if 
this set contains only one element, f’(#) exists and is that element. 

DerFinition 5:2. Let f(-, t) be a single-valued function on the interval (c, d) 
to the adjoint E* of a Banach space E. If for some t on the interval there exist 
constants M and d, such that || f(-,t + a) — f(-, 0) || < M|a|when|a| <a, 
the set 


o(-,t) = I] Afall [f(.,¢ + a) —f(., )]/a with |a| < dl, 
0<dsd, 


is defined to be the “derivative” of f at t. As before, the letter h denotes the weak 
closure on E of the set indicated. 

If the condition of the definition is satisfied, the sets occurring therein are 
bounded and weakly closed on FE. But these sets are bicompact in the topology 
of Theorem 1:3, and they form a monotone decreasing system. Hence their 
intersection is non-null, in consequence of a lemma on bicompact spaces which 
is well-known in a special case [4, p. 259]. 
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Lemma 5:1. If (Ga) is a directed system of closed non-null subsets of a bicom- 
pact Hausdorff space K, and if further, Ga, C Ga, whenever aRaz holds, then 
is non-null. 

If the sets have no point in common, for every point p of K there is a 
set Ga(p) not containing p, and hence a neighborhood U, of p such that 
U,Ga(p) = 0. These neighborhoods cover K; and since K is bicompact, a finite 
number U>», will cover the whole space. But then if a stands in the relation R 
to the a; corresponding to the points p;, GaU,; = O for all. This is a contra- 
diction. 

TuroreM 5:1. If the function f(-, t) on the interval (c, d) to the space E* is of 
bounded absolute variation, it has a ‘‘derivative’’ $(-, t) almost everywhere, which 
is integrable over (c,d) on E. Except on a set of zero measure, $(-, t) = ¥(-, tg(t), 
where ¥(-, t) is bounded, and g(t) is a Lebesgue-integrable real function. 

The proof depends on a device used by Dunford and Morse [13, p. 419]. 
Let V(t) = supg || f(-, ax) — f(-, ||, where II ranges over all partitions 
c=% 5a --: S a, = t of the interval (c, 4). The function V(é) is the 
total variation of f on the interval (c, ¢), and is non-decreasing. Let V;(t) = 
V(t) +t — c. Then V; is properly increasing, and || f(-, 4) — f(-, || 
| Vi(t.) — Vi(te) |. The function V; has a derivative except on a set A of zero 
measure, so that if t is not in A, there is a constant M, such that | Vi(¢ + a) — 
Vi(t)| < M,|a|for every a. Consequently f has a “derivative” ¢(-, t) except 
on A; and since || (-, é) || S Vi(), the function ¢ satisfies the condition of 
summability in Theorem 4:2. 

If g lies in the set $(-, ¢), there exists a directed system (a.) of real numbers 
such that g = w-lim, [f(-, t + ae) — f(-, )]/a., and such that lim, a. = 0. 
Now let f(x, #) denote the value of the functional f(-, ¢) at the point z of the 
space E. Then f(z, ) as a function of ¢ is of bounded variation for each z, and 
so has a derivative f’(x, t) except on a set A, of zero measure. It follows that 
if tis not in the set A + Az, g(x) = f’(z, 2), so that o(z, t) = f’(z, t) except 
on the set A + A, of zero measure. Hence ¢(z, ¢) is integrable over (c, d) on E, 
since, as is well-known, the function f’(x, t) is Lebesgue-integrable. 

The function Vj(é) is integrable and nowhere zero, so that ¥(-, t) = 
¢(-, t)/Vi(t) is bounded. This completes the proof of the theorem. 

DeriniTIon 5:3. Let f(t) be a single-valued function on the interval (c, d) to 
the Banach space E. Then f is absolutely continuous if for every « > 0 there 
exists an » > 0 such that > || f(ax) — f(bx) || < «€ of the intervals (a, , bx) are 
disjoint and have the sum of their lengths less than n. 

It is readily verified that if the function f(-, ¢) on the interval (c, d) to E* is 
absolutely continuous, it is of bounded absolute variation, and that the func- 
tions f(x, t) are absolutely continuous for every x in EZ. It was shown in the 
proof of Theorem 5:1 that if ¢(z, ¢) is the “derivative” of f, o(z, t) = f’(z, 4) 
almost everywhere. But if f(z, t) is absolutely continuous, f(x, t) — f(z, c) = 


/ f'(x, t) dt. This suffices to prove the following theorem. 
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TueoreM 5:2. If the function f(-, t) on the interval (c, d) to E* is absolutely 
continuous, it has a “derivative” almost everywhere which is integrable over (c, d) 
on E. For each x, t) = f' (ax, t) almest everywhere, and 


We now turn to the question of when the “‘derivative”’ is single-valued almost 
everywhere, and when it possesses stronger integrability properties. 

Derinition 5:4. (1) If the subset G of the adjoint E* of the Banach space E 
is total, the function f(t) on the interval (c, d) to E is said to have a weak derivative 
f'() on G at tif = w-lim,— [f(t + a) — f(d]/a on G. 

(2) The function f(t) on the interval (c, d) to the space E has a strong derivative 
at tif = Tim, [f(t + a) — in the norm. 

A preliminary observation can be made. Suppose that the “derivative” 
¢(-, t) of a function f(-, ¢) with values in an adjoint space E* is single-valued 
at a point t. Then ¢(-, ¢) is the weak derivative of f(-, t) on E at t. For, if 
¢(-, t) exists, the set of numbers [f(r, t + a) — f(x, t)]/a is bounded for each 
element x of E, so that if lim, a, = 0, there is a subsequence (a,,) such that 
lim, [f(a, + @n,) — f(x, t)]/an, exists. The sets 


h [all [f(-, ¢ + an,) — f(-, D]/an, with k = m) 


have the point ¢(-, ¢) in common, so that there is a directed set (aa) of the numbers 
a», such that lim, aa = 0, and such that ¢(-, ¢) = w-lim, [f(-, a) — f(-, 
on FE. Consequently ¢(z, t) = lima [f(z, t + aa) — f(a, = lim; [f(z, 
t + an,) — f(a, t)]/an, , 80 that o(x, t) = limao [f(z, t + a) — f(z, t)]/a for every 
element x of EZ. This result is used in the proof of the next theorem, which 
was obtained by Gelfand [15, p. 245]. 

THEOREM 5:3. If the function f on the interval (c, d) to E* is of bounded abso- 
lute variation, and if E is separable, the ‘derivative’ $(-, t) is almost everywhere 
the weak derivative of f on E. 

If furthermore the space E* is separable, the “derivative” of f is almost every- 
where the strong derivative of f. This function @ is then Bochner integrable, and 


t 
af f is absolutely continuous, f(-, t) — f(-, ¢) = o(-, dt in Bochner’s sense. 


In view of what has just been shown, it will suffice for the first part to prove 
that ¢(-, ¢) is single-valued almost everywhere. If then EZ is separable, let (<») 
be a sequence everywhere dense on E. The function ¢(z, , t) is single-valued 
except on a set A, of zero measure, by Theorem 5:2; hence ¢(-, é) is single- 
valued except on >>, An. 

To prove the second part of the theorem, we first observe that if E isseparable, 
|| $(-, @) || is integrable. For, if (zn) is the sequence everywhere dense on £, 
|| = sup, | (an, t) | / || an || almost everywhere, so that || ¢(-, 2) || 
is measurable. But, by Theorem 5:1, || $(-, ¢) || is bounded above by an in- 
tegrable function, so that it is itself integrable. Now let n@,(-, t) = $(-) 4) 
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if || || and let $.(-, 2) = otherwise. Then = lim, nga(-, 
in the norm almost everywhere, and ¢,(z, /) is measurable for every element x 
of E. We denote by ¢,'(U) the set of numbers ¢ such that ¢,(-, 4) lies in the 
subset U of E*. Then if U is a member of the neighborhood system (1:1) 
used in Theorem 1:3, ¢, (U) is measurable. But if E is separable, the sphere K 
of that theorem satisfies the second countability axiom, so that if U is a Borel 
set in the topology of Theorem 1:3, ¢;'(U) is measurable. We now note that 
a closed subsphere Ko of K is weakly closed, so that Ko is a Borel set in the 
weak topology; and since open subspheres of K are sums of a denumerable 
number of closed subspheres, these too are Borel sets in the weak topology. 

If E* is separable, every subset of K which is Borel measurable in the metric 
topology is a Borel set in the weak topology. Hence K can be covered by a 
sequence of disjoint subsets (B,) each Borel measurable in the weak topology, 
and each of diameter not exceeding 1/m. Let gy lie in B, , and let dam(t) = gx 
if ¢ lies in ¢,'(B,). Then the functions ¢, are Bochner measurable, and 
limm Gnm(t) = On(-, ) in the norm, so that the ¢, , and hence ¢ are Bochner 
measurable [10, p. 263]. Since || ¢(-, ¢) || is integrable, ¢(-, ¢) is Bochner 
integrable [10, p. 265]. But Birkhoff has shown that a Bochner integrable 
function is integrable in his sense to the same value [11, p. 377], and it has been 
indicated in Section 4 that Birkhoff integrable functions are integrable over E 


t 
to the same value. Consequently, f(-,t) — f(-,c) = i ¢(-, t) dt in Bochner’s 


sense, if f is absolutely continuous. The proof is completed on noting that 
indefinite Bochner integrals have their integrands as their strong derivatives 
almost everywhere [10, p. 269]. 
If f is not absolutely continuous, one can apply a criterion of Dunford and 
t 


Morse [13, p. 417] to show that g = f — / ¢ dt has a strong derivative vanishing 


almost everywhere. If ¢ > 0, there a partition I = (@ =~cSam2Z-:-:- 8 
a, = d) such that Vg) || g(-, ai) — g(-, || + But there 
exist x; of unit norm such that || g(-, ai) — g(-, as1) || S | g(a, ai) — g(xi, a) | 
+ < Vig(a:, -), Ei] + where E; denotes the interval (a;-1 , 
The criterion cited can be applied to the real singular functions g(x, t) to yield 
open sets G; with measures each not exceeding ¢/ n2’, such that V[g(z;, -), Ei] = 
Vig(a:, -), GJ. Also, if x is of unit norm, V[g(z, -), G] < Vig, GJ. Hence, 
if@’ = G;, < and V(g) Vig, + «/2’; and if G = @’, 
mG < e,and V(g) = V(g,G@). Thus the criterion cited is satisfied, and g has a 
strong derivative equal to zero almost everywhere, so that f has a strong deriva- 
tive as stated in the theorem. 

Before proceeding further with the question of strong derivatives, we prove 
some lemmas on uniformly convex spaces. A Banach space E is said to be 
uniformly convex if for every ¢ between zero and two there is a 6(e) > 0 such that 
+ y|| 2(1 — 6(€)) whenever || x || = || y|| = 1, and||z — y||2 « 
(12, p. 396]. We shall prove that a uniformly convex space E is reflexive; that 
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is, if @ is in the second adjoint #** of E, there exists an x in # such that ¢(f) = 
f(x) for every element f of E*. 

Lemma 5:2. If ¢ belongs to the second adjoint E** of the Banach space E, there 
exists a directed system (tq) of elements of E such that || tq || = || ||; and such 
that o(f) = lima f(a) for every element f of E. The directed set A can be chosen to 
be independent of 9. 

This lemma is due to Goldstine [17, p. 128]. 

Lemma 5:3. If E is uniformly convex, so is E**. 

Let ¢1 , $2 be in E**, and let || ¢: || = || ¢2|| = 1. By Lemma 5:2, there exist 
two directed systems (ra), (ya), such that || za || = || ye || = 1, lime f(z,) = 
oi(f), and lima f(ya) = ¢2(f). Then, if = || ¢: — || , itis true that < liminf, 
|| 2a — Ya\|. Hence there is an a, such that || t2 — ya || = €/2 if aRa, holds. 
But then, since ¢:(f) + ¢2(f) = lima f(ta + Ya), it follows that || ¢i + ¢e || < 
lim inf, || 22 + Ye || S 2(1 — 6(e/2)), so that H** is uniformly convex. 

Lemma 5:4. Let the subset G of the space E* be linear and total, and such that 
\| || = sup, ing | g(x) |/||g ||. Then, if E is uniformly convex, if x = w-lim, 
Lq on G, and if lima || a || = || x || , 2 7s true that lim, || x — ra || = 0. 

For if not, there is an e > 0 and for each @ an a, such that || x — za, || 2 «¢, 
and such that aa holds. It is then readily verified that x = w-lim, 24, 
on G, so that it can be assumed that for all a || 2 — ra|| = €. It will also be 
sufficient to assume that || x || = || 2.|| = 1. By the same method as that 
used to prove Theorem 1:1, it can be shown that if x = w-lim, x, on G, || «|| < 
lim inf, || 2a ||. Then, since 22 = w-lim, (x + 2.) on G, it follows that 2 = 
|| 2a || S liminf, || 7+ 24 || S 2(1 — 6(e)) < 2, a contradiction. 

TuHroreM 5:4.? If the Banach space E or its adjoint E* is uniformly convex, 
then E is reflexive. 

Suppose first that EF is uniformly convex, and let @ be in E**. Then, by 
Lemma 5:2, there exists a system (x) of elements of E such that || ze || = ||¢||, 
and such that ¢(f) = lima f(a). Let us set o:(f) = f(x), o(f) = F/@), and let G 
be the set of all F;. Then, by Lemma 5:3, E**, G, ¢, and the directed system 
(¢z,) satisfy the conditions of Lemma 5:4, so that lima || ¢z, — ¢@ || = 0. But, 
if this be so, we can choose a sequence (z,) of the elements xz. such that lim, 
|| @z, — @|| = 0. Consequently, the sequence (z,) converges to an element 
x of EH, and ¢ = ¢,, so that E is reflexive. 

If E* is uniformly convex, then E* is reflexive. But Pettis [18, p. 421] has 
shown that this implies that E is reflexive. It should be noted that FZ will be 
reflexive if, instead of Z or E* being uniformly convex, one of E or E* is equiva- 
lent to a uniformly convex space. 

We can now summarize the results of Clarkson [12, p. 407], Pettis [18, p. 427], 
and Dunford and Morse [13, p. 415] in the following theorem, which is a corollary 
of Theorem 5:3. 


? Proved by D. Milman, On some criteria for the regularity of spaces of type (B), Comp- 
tes Rendus de |’ Academie des Sciences de |’ URSS, vol. 20 (1938), pp. 243-246. See also 
B. J. Pettis, Duke Journal of Mathematics, vol. 5 (1939), pp. 249-253. 
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TuroreM 5:5. Each of the following conditions is sufficient that a function f on 
an interval (c, d) to a Banach space E which is of bounded absolute variation should 
have a strong derivative almost everywhere: 

(1) Every linear, closed, separable subspace of E is reflexive; 

) E is reflexive (18, p. 427]; 

(3) Eis uniformly convex [12, p. 407]; 

(4) E* is uniformly convex; 

(5) E has a basis (En) such that the series Dondnén converges in the norm whenever 
its partial sums are bounded (13, p. 415]. 

By Theorem 5:4, if either of E or E* is uniformly convex, then E is reflexive. 
If E is reflexive, then E satisfies condition (1) [18, p. 423]. Hence to prove the 
sufficiency of conditions (2), (3), (4), we need only prove that (1) is sufficient. 

If f is of bounded absolute variation, it is readily verified that f can have at 
most a denumerable number of discontinuities. It follows that the values of 
f(t) lie in a separable subspace E» of EH, which can be assumed to be linear and 
closed. If (1) holds, E, is reflexive, and hence one can regard f as having values 
in E*. But Eo” is a separable space adjoint to the space £4, so that, by 
Theorem 5:3, f has a strong derivative almost everywhere. To prove that (5) 
suffices, we use Theorem 2:2. £ is then separable, and is isomorphic to the 
adjoint Zo of a Banach space Ey. From Theorem 5:3 it follows that f has a 
strong derivative almost everywhere. 


6. Representation of linear operations 


In this section two representation theorems are proved. The first of these 
gives the form of the linear and continuous functionals on the Banach space 
X of all Bochner integrable functions g on an interval (c, d) to a Banach space E. 


The norm in % is defined by the equation || g || = [ || g(t) || dt (10, p. 272]. 


The result given reduces to the well-known representation of linear continuous 
functionals on the space (L) of Lebesgue integrable functions. 

THEOREM 6:1. If the functional & is in %*, there exists a function $(-, t) on the 
interval (c, d) to E*, which is bounded and integrable over (c, d) on E, and is such 
that ¢|g(t), t| is integrable for every g of ¥, and 


with 
|| || = ess sup, || (-, || 
let K(t, 2, r) = x if t < r, and let K(t, z, = 0 otherwise. Then for each r 
m (c, d) and « in E, K(., 2, r) isin ¥ and || K(-, 2, r) || = ||z||-|r —e|. The 
function p(-, r) defined by the equation p(z, r) = ¢[K(., x, r)] has values in E*, 


and || p(-, r) — p(-, s) || S$ !|€||-|r — s|. Hence p(-, é) is absolutely con- 
tinuous, so that by Theo 5:2 it has a “derivative” $(-, ¢) almost everywhere, 
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such that || ¢(-, 4) || S || € || almost everywhere. For each r and z, ¢[K(t, z, r),t] 
is integrable with respect to t, and é[K(-, 2, r)] = [ ¢[K(t, x, r), t] dt. It follows 


that if o is a step-function in %, &(c) = g[o(t), t] dt. Also | ¢[g(t), li < 


|| g(t) ||-|| € || almost everywhere. If g is in %, there exists a sequence (c,) of 
step-functions of X, such that g(t) = lim,co,(¢) in the norm of E except on a set 
Ao of zero measure, and such that g = lim, o» in the norm of X (10, p. 272). 
Let (am) be the sequence of values of the step-functions o, ; then $(zm, @) is 
single-valued and measurable except on a set Am of zero measure, so that lim, 
olen), = except on But | Al] g@ || almost 


everywhere, so that ¢[g(é), is integrable. Consequently é(g) = / ¢lg(t), dt, 


since 
d 
lim sup, | — ox(0), dt lim supa || 9 — |] =0. 


To determine the norm of £, we note that | &(g) | S ess sup; || ¢(-, 4) ||-|! ¢|| 
and that || ¢(-, é) || 2 || & || almost everywhere. 

THEOREM 6:2. If T is a linear and continuous transformation on the space (L) 
of Lebesgue integrable functions on the interval (c, d) to the adjoint E* of a Banach 
space E, there exists a bounded function $(-, t) on (c, d) to E* integrable over (c, d) 
on E such that for every function g in (L) 


d 
Ty = at, 
with 
|| = ess supe 9 Ih 


Let K(t, r) = 1 if ¢ S 1, and let K(t, r) = 0 otherwise. Then K(., 7) isin 
(L) for every r in (c, d), and || K(-,r) — K(-, s) || = |r — s|, so that if p(-,r) = 
TK(-, r), || p(-, r) — s) || || 7 — s|. Hence p is absolutely 
continuous, and by Theorem 5:2, it has a “derivative” $(., ¢) on (c, d) to E* 
integrable on EF, and such that || ¢(-, @) || < || 7'||. If g is an element of (JL), 
¢(-, t)g(t) is weakly summable, and if z lies in E, ¢(z, t)g(é) is integrable. Hence 


¢(-, é)g(d) is integrable over (c, d) on E, and if g is a step-function, Tg = [ g(t) 
¢(-, t) dt. But step-functions are everywhere dense in (L), so that if f = 79, 
fle) = at. Hence Tq = 0 at, and < 0 | 


|| || ess supe || || , which determines the norm of 7. 
6:1. If 7 is a linear continuous transformation on (L) to the Banach 
space E, there exists a bounded function $(., t) on (c, d) to E** integrable over (c, 4d) 
d 


on E* such that, if x = Tg, and $.(f) = f(z), then ¢. = / g(t)o(-, t) dt, and 
|| T || = ess sup, || ¢(-, é ||. 
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] To prove this corollary, we make use of the transformation ¢.(f) = f(x), so 
that 7' can be regarded as having values in E**. 

The results of Theorems 5:3 and 5:5 can be applied if the range E of T satisfies 
conditions which are sufficient that functions of bounded absolute variation 
should have strong derivatives. For such transformations the function ¢(-, ¢) 
always has its values in H, and the integral of the representation is a Bochner 
integral. 

caine it should be noted that the results of Section 5 can be obtained 
without trouble for functions of intervals in a Euclidean space; the proofs depend 

on the validity of these results for numerically-valued functions. 


£ 
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